Compression of Deep-Learning Models Through Global Weight Pruning Using Alternating Direction Method of Multipliers

Lee, Kichun; Hwangbo, Sunghun; Yang, Dongwook; Lee, Geonseok

doi:10.1007/s44196-023-00202-z

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Compression of Deep-Learning Models Through Global Weight Pruning Using Alternating Direction Method of Multipliers

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Kichun	-
dc.contributor.author	Hwangbo, Sunghun	-
dc.contributor.author	Yang, Dongwook	-
dc.contributor.author	Lee, Geonseok	-
dc.date.accessioned	2023-11-14T02:00:08Z	-
dc.date.available	2023-11-14T02:00:08Z	-
dc.date.created	2023-03-08	-
dc.date.issued	2023-02	-
dc.identifier.issn	1875-6891	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/192070	-
dc.description.abstract	Deep learning has shown excellent performance in numerous machine-learning tasks, but one practical obstacle in deep learning is that the amount of computation and required memory is huge. Model compression, especially in deep learning, is very useful because it saves memory and reduces storage size while maintaining model performance. Model compression in a layered network structure aims to reduce the number of edges by pruning weights that are deemed unnecessary during the calculation. However, existing weight pruning methods perform a layer-by-layer reduction, which requires a predefined removal-ratio constraint for each layer. Layer-by-layer removal ratios must be structurally specified depending on the task, causing a sharp increase in the training time due to a large number of tuning parameters. Thus, such a layer-by-layer strategy is hardly feasible for deep layered models. Our proposed method aims to perform weight pruning in a deep layered network, while producing similar performance, by setting a global removal ratio for the entire model without prior knowledge of the structural characteristics. Our experiments with the proposed method show reliable and high-quality performance, obviating layer-by-layer removal ratios. Furthermore, experiments with increasing layers yield a pattern in the pruned weights that could provide an insight into the layers’ structural importance. The experiment with the LeNet-5 model using MNIST data results in a higher compression ratio of 98.8% for the proposed method, outperforming existing pruning algorithms. In the Resnet-56 experiment, the performance change according to removal ratios of 10–90% is investigated, and a higher removal ratio is achieved compared to other tested models. We also demonstrate the effectiveness of the proposed method with YOLOv4, a real-life object-detection model requiring substantial computation.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	Springer Science and Business Media B.V.	-
dc.title	Compression of Deep-Learning Models Through Global Weight Pruning Using Alternating Direction Method of Multipliers	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Lee, Kichun	-
dc.identifier.doi	10.1007/s44196-023-00202-z	-
dc.identifier.scopusid	2-s2.0-85148941525	-
dc.identifier.wosid	000939369800001	-
dc.identifier.bibliographicCitation	International Journal of Computational Intelligence Systems, v.16, no.1, pp.1 - 13	-
dc.relation.isPartOf	International Journal of Computational Intelligence Systems	-
dc.citation.title	International Journal of Computational Intelligence Systems	-
dc.citation.volume	16	-
dc.citation.number	1	-
dc.citation.startPage	1	-
dc.citation.endPage	13	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Interdisciplinary Applications	-
dc.subject.keywordPlus	Layer by layer	-
dc.subject.keywordPlus	Layer removal	-
dc.subject.keywordPlus	Layered network	-
dc.subject.keywordPlus	Model compression	-
dc.subject.keywordPlus	Network compression	-
dc.subject.keywordPlus	Nonconvex optimization	-
dc.subject.keywordPlus	Parallel com- puting	-
dc.subject.keywordPlus	Performance	-
dc.subject.keywordPlus	Removal ratios	-
dc.subject.keywordPlus	Weight pruning	-
dc.subject.keywordAuthor	Network compression	-
dc.subject.keywordAuthor	Non-convex optimization	-
dc.subject.keywordAuthor	Parallel computing	-
dc.subject.keywordAuthor	Weight pruning	-
dc.identifier.url	https://link.springer.com/article/10.1007/s44196-023-00202-z	-

Files in This Item

s44196-023-00202-z.pdf 1.5 MB

Appears in Collections: 서울 공과대학 > 서울 산업공학과 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Lee, Ki chun photo

Lee, Ki chun: COLLEGE OF ENGINEERING (DEPARTMENT OF INDUSTRIAL ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :5,290,486; Today View :10,511

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE