Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

KERNTROL: Kernel Shape Control Toward Ultimate Memory Utilization for In-Memory Convolutional Weight Mapping

Authors
Rhe, JohnnyJeon, Kang EunLee, Joo ChanJeong, SeongmoonKo, Jong Hwan
Issue Date
22-Feb-2024
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Computer architecture; Convolution; convolutional neural network; convolutional weight mapping; Kernel; kernel shape control (KERNTROL); Processing-in-memory; Shape; Shape control; shift and duplicate (SDK) weight mapping; Throughput; Vectors
Citation
IEEE Transactions on Circuits and Systems I: Regular Papers, pp 1 - 14
Pages
14
Indexed
SCIE
SCOPUS
Journal Title
IEEE Transactions on Circuits and Systems I: Regular Papers
Start Page
1
End Page
14
URI
https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/110278
DOI
10.1109/TCSI.2024.3365175
ISSN
1549-8328
1558-0806
Abstract
Processing-in-memory (PIM) architectures have been highlighted as one of the most viable options for faster and more power-efficient computation. Paired with a convolutional weight mapping scheme, PIM arrays can accelerate various deep convolutional neural networks (CNNs) and the applications that adopt them. Recently, shift and duplicate kernel (SDK) convolutional weight mapping scheme was proposed, achieving up to 50% throughput improvement over the prior arts. However, the traditional pattern-based pruning methods, which were adopted for row-skipping and computing cycle reduction, are not optimal for the latest SDK mapping due to the loss of structural regularity caused by the shifted and duplicated kernels. To address this challenge, we propose kernel shape control (KERNTROL), a method where kernel shapes are controlled depending on their mapped columns with the purpose of fostering a structural regularity that is favorable in achieving a high row-skipping ratio and model accuracy. Instead of permanently pruning the weights, KERNTROL with an empty mask (KERNTROL-M) temporarily omits them in the underutilized row using a utilization threshold, thereby preserving important weight elements. However, a significant portion of the memory cells is still underutilized where the threshold is not enforced. To overcome this, we extend KERNTROL-M into KERNTROL with compensatory weights (KERNTROL-C). By populating idle cells with compensatory weights, KERNTROL-C can offset the accuracy drop from weight omission. In comparison to pattern-based pruning approaches, KERNTROL-C achieves simultaneous improvements of up to 36.4% improvement in the compression rate and 5% in model accuracy with up to 100% array utilization. IEEE
Files in This Item
There are no files associated with this item.
Appears in
Collections
Information and Communication Engineering > School of Electronic and Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher KO, JONG HWAN photo

KO, JONG HWAN
Information and Communication Engineering (Electronic and Electrical Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE