Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Regularizing activations in neural networks via distribution matching with the Wasserstein metric

Full metadata record
DC Field Value Language
dc.contributor.authorJoo, Taejong-
dc.contributor.authorKang, Donggu-
dc.contributor.authorKim, Byunghoon-
dc.date.accessioned2021-06-22T09:04:56Z-
dc.date.available2021-06-22T09:04:56Z-
dc.date.issued2020-04-
dc.identifier.urihttps://scholarworks.bwise.kr/erica/handle/2021.sw.erica/1142-
dc.description.abstractRegularization and normalization have become indispensable components in training deep neural networks, resulting in faster training and improved generalization performance. We propose the projected error function regularization loss (PER) that encourages activations to follow the standard normal distribution. PER randomly projects activations onto one-dimensional space and computes the regularization loss in the projected space. PER is similar to the Pseudo-Huber loss in the projected space, thus taking advantage of both $L^1$ and $L^2$ regularization losses. Besides, PER can capture the interaction between hidden units by projection vector drawn from a unit sphere. By doing so, PER minimizes the upper bound of the Wasserstein distance of order one between an empirical distribution of activations and the standard normal distribution. To the best of the authors' knowledge, this is the first work to regularize activations via distribution matching in the probability distribution space. We evaluate the proposed method on the image classification task and the word-level language modeling task.-
dc.format.extent13-
dc.language영어-
dc.language.isoENG-
dc.publisherInternational Conference on Learning Representations-
dc.titleRegularizing activations in neural networks via distribution matching with the Wasserstein metric-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.bibliographicCitationInternational Conference on Learning Representations 2020, pp 1 - 13-
dc.citation.titleInternational Conference on Learning Representations 2020-
dc.citation.startPage1-
dc.citation.endPage13-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassforeign-
dc.subject.keywordPlusComputer Science - Machine Learning-
dc.subject.keywordPlusStatistics - Machine Learning-
dc.identifier.urlhttps://arxiv.org/abs/2002.05366-
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF ENGINEERING SCIENCES > DEPARTMENT OF INDUSTRIAL & MANAGEMENT ENGINEERING > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Byunghoon photo

Kim, Byunghoon
ERICA 공학대학 (DEPARTMENT OF INDUSTRIAL & MANAGEMENT ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE