Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Implicit Jacobian Regularization Weighted with Impurity of Probability Output

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Sungyoon-
dc.contributor.authorPark, Jinseong-
dc.contributor.authorLee, Jaewook-
dc.date.accessioned2023-11-14T08:15:09Z-
dc.date.available2023-11-14T08:15:09Z-
dc.date.created2023-11-01-
dc.date.issued2023-07-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/192170-
dc.description.abstractThe success of deep learning is greatly attributed to stochastic gradient descent (SGD), yet it remains unclear how SGD finds well-generalized models. We demonstrate that SGD has an implicit regularization effect on the logit-weight Jacobian norm of neural networks. This regularization effect is weighted with the impurity of the probability output, and thus it is active in a certain phase of training. Moreover, based on these findings, we propose a novel optimization method that explicitly regularizes the Jacobian norm, which leads to similar performance as other state-of-the-art sharpness-aware optimization methods.-
dc.language영어-
dc.language.isoen-
dc.publisherML Research Press-
dc.titleImplicit Jacobian Regularization Weighted with Impurity of Probability Output-
dc.typeArticle-
dc.contributor.affiliatedAuthorLee, Sungyoon-
dc.identifier.scopusid2-s2.0-85174424115-
dc.identifier.bibliographicCitationProceedings of Machine Learning Research, v.202, pp.19094 - 19140-
dc.relation.isPartOfProceedings of Machine Learning Research-
dc.citation.titleProceedings of Machine Learning Research-
dc.citation.volume202-
dc.citation.startPage19094-
dc.citation.endPage19140-
dc.type.rimsART-
dc.type.docTypeConference paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusDeep learning-
dc.subject.keywordPlusGradient methods-
dc.subject.keywordPlusOptimization-
dc.subject.keywordPlusStochastic models-
dc.identifier.urlhttps://proceedings.mlr.press/v202/lee23q.html-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Sungyoon photo

Lee, Sungyoon
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE