FIND A WINNING SIGN: SIGN IS ALL WE NEED TO WIN THE LOTTERY

Oh, Junghun; Baik, Sungyong; Lee, Kyoung Mu

doi:10.48550/arXiv.2504.05357

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

FIND A WINNING SIGN: SIGN IS ALL WE NEED TO WIN THE LOTTERY

Full metadata record

DC Field	Value	Language
dc.contributor.author	Oh, Junghun	-
dc.contributor.author	Baik, Sungyong	-
dc.contributor.author	Lee, Kyoung Mu	-
dc.date.accessioned	2025-08-12T07:00:10Z	-
dc.date.available	2025-08-12T07:00:10Z	-
dc.date.issued	2025-04	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208495	-
dc.description.abstract	The Lottery Ticket Hypothesis (LTH) posits the existence of a sparse subnetwork (a.k.a. winning ticket) that can generalize comparably to its over-parameterized counterpart when trained from scratch. The common approach to finding a winning ticket is to preserve the original strong generalization through Iterative Pruning (IP) and transfer information useful for achieving the learned generalization by applying the resulting sparse mask to an untrained network. However, existing IP methods still struggle to generalize their observations beyond ad-hoc initialization and small-scale architectures or datasets, or they bypass these challenges by applying their mask to trained weights instead of initialized ones. In this paper, we demonstrate that the parameter sign configuration plays a crucial role in conveying useful information for generalization to any randomly initialized network. Through linear mode connectivity analysis, we observe that a sparse network trained by an existing IP method can retain its basin of attraction if its parameter signs and normalization layer parameters are preserved. To take a step closer to finding a winning ticket, we alleviate the reliance on normalization layer parameters by preventing high error barriers along the linear path between the sparse network trained by our method and its counterpart with initialized normalization layer parameters. Interestingly, across various architectures and datasets, we observe that any randomly initialized network can be optimized to exhibit low error barriers along the linear path to the sparse network trained by our method by inheriting its sparsity and parameter sign information, potentially achieving performance comparable to the original. The code is available at https://github.com/JungHunOh/AWS_ICLR2025.git.	-
dc.format.extent	15	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	International Conference on Learning Representations, ICLR	-
dc.title	FIND A WINNING SIGN: SIGN IS ALL WE NEED TO WIN THE LOTTERY	-
dc.type	Article	-
dc.identifier.doi	10.48550/arXiv.2504.05357	-
dc.identifier.scopusid	2-s2.0-105010212939	-
dc.identifier.bibliographicCitation	13th International Conference on Learning Representations, ICLR 2025, pp 26059 - 26073	-
dc.citation.title	13th International Conference on Learning Representations, ICLR 2025	-
dc.citation.startPage	26059	-
dc.citation.endPage	26073	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.identifier.url	https://arxiv.org/abs/2504.05357	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Baik, Sungyong photo

Baik, Sungyong: COLLEGE OF ENGINEERING (DEPARTMENT OF INTELLIGENCE COMPUTING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE