KERNTROL: Kernel Shape Control Toward Ultimate Memory Utilization for In-Memory Convolutional Weight Mapping
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Rhe, Johnny | - |
dc.contributor.author | Jeon, Kang Eun | - |
dc.contributor.author | Lee, Joo Chan | - |
dc.contributor.author | Jeong, Seongmoon | - |
dc.contributor.author | Ko, Jong Hwan | - |
dc.date.accessioned | 2024-04-08T23:00:34Z | - |
dc.date.available | 2024-04-08T23:00:34Z | - |
dc.date.issued | 2024-02-22 | - |
dc.identifier.issn | 1549-8328 | - |
dc.identifier.issn | 1558-0806 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/110278 | - |
dc.description.abstract | Processing-in-memory (PIM) architectures have been highlighted as one of the most viable options for faster and more power-efficient computation. Paired with a convolutional weight mapping scheme, PIM arrays can accelerate various deep convolutional neural networks (CNNs) and the applications that adopt them. Recently, shift and duplicate kernel (SDK) convolutional weight mapping scheme was proposed, achieving up to 50% throughput improvement over the prior arts. However, the traditional pattern-based pruning methods, which were adopted for row-skipping and computing cycle reduction, are not optimal for the latest SDK mapping due to the loss of structural regularity caused by the shifted and duplicated kernels. To address this challenge, we propose kernel shape control (KERNTROL), a method where kernel shapes are controlled depending on their mapped columns with the purpose of fostering a structural regularity that is favorable in achieving a high row-skipping ratio and model accuracy. Instead of permanently pruning the weights, KERNTROL with an empty mask (KERNTROL-M) temporarily omits them in the underutilized row using a utilization threshold, thereby preserving important weight elements. However, a significant portion of the memory cells is still underutilized where the threshold is not enforced. To overcome this, we extend KERNTROL-M into KERNTROL with compensatory weights (KERNTROL-C). By populating idle cells with compensatory weights, KERNTROL-C can offset the accuracy drop from weight omission. In comparison to pattern-based pruning approaches, KERNTROL-C achieves simultaneous improvements of up to 36.4% improvement in the compression rate and 5% in model accuracy with up to 100% array utilization. IEEE | - |
dc.format.extent | 14 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | KERNTROL: Kernel Shape Control Toward Ultimate Memory Utilization for In-Memory Convolutional Weight Mapping | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/TCSI.2024.3365175 | - |
dc.identifier.scopusid | 2-s2.0-85186082215 | - |
dc.identifier.wosid | 001178999900001 | - |
dc.identifier.bibliographicCitation | IEEE Transactions on Circuits and Systems I: Regular Papers, pp 1 - 14 | - |
dc.citation.title | IEEE Transactions on Circuits and Systems I: Regular Papers | - |
dc.citation.startPage | 1 | - |
dc.citation.endPage | 14 | - |
dc.type.docType | Article; Early Access | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.subject.keywordPlus | ARCHITECTURE | - |
dc.subject.keywordPlus | PRECISION | - |
dc.subject.keywordPlus | EFFICIENT | - |
dc.subject.keywordAuthor | Computer architecture | - |
dc.subject.keywordAuthor | Convolution | - |
dc.subject.keywordAuthor | convolutional neural network | - |
dc.subject.keywordAuthor | convolutional weight mapping | - |
dc.subject.keywordAuthor | Kernel | - |
dc.subject.keywordAuthor | kernel shape control (KERNTROL) | - |
dc.subject.keywordAuthor | Processing-in-memory | - |
dc.subject.keywordAuthor | Shape | - |
dc.subject.keywordAuthor | Shape control | - |
dc.subject.keywordAuthor | shift and duplicate (SDK) weight mapping | - |
dc.subject.keywordAuthor | Throughput | - |
dc.subject.keywordAuthor | Vectors | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(03063) 25-2, SUNGKYUNKWAN-RO, JONGNO-GU, SEOUL, KOREAsamsunglib@skku.edu
COPYRIGHT © 2021 SUNGKYUNKWAN UNIVERSITY ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.