DoubleQExt: Hardware and Memory Efficient CNN Through Two Levels of Quantization

See, Jin-Chuan; Ng, Hui-Fuang; Tan, Hung-Khoon; Chang, Jing-Jing; Lee, Wai-Kong; Hwang, Seong Oun

Detailed Information

Cited 5 time in webofscience

Cited 6 time in scopus

Metadata Downloads

DoubleQExt: Hardware and Memory Efficient CNN Through Two Levels of Quantization

Authors: See, Jin-Chuan; Ng, Hui-Fuang; Tan, Hung-Khoon; Chang, Jing-Jing; Lee, Wai-Kong; Hwang, Seong Oun

Issue Date: Dec-2021

Publisher: IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords: Quantization (signal); Convolutional neural networks; Hardware; Memory management; Field programmable gate arrays; Internet of Things; Degradation; Convolutional neural network; quantization; Internet of Things; deep learning; field programmable gate array

Citation: IEEE ACCESS, v.9, pp.169082 - 169091

Journal Title: IEEE ACCESS

Volume: 9

Start Page: 169082

End Page: 169091

URI: https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/83190

DOI: 10.1109/ACCESS.2021.3138756

ISSN: 2169-3536

Abstract: To fulfil the tight area and memory constraints in IoT applications, the design of efficient Convolutional Neural Network (CNN) hardware becomes crucial. Quantization of CNN is one of the promising approach that allows the compression of large CNN into a much smaller one, which is very suitable for IoT applications. Among various proposed quantization schemes, Power-of-two (PoT) quantization enables efficient hardware implementation and small memory consumption for CNN accelerators, but requires retraining of CNN to retain its accuracy. This paper proposes a two-level post-training static quantization technique (DoubleQ) that combines the 8-bit and PoT weight quantization. The CNN weight is first quantized to 8-bit (level one), then further quantized to PoT (level two). This allows multiplication to be carried out using shifters, by expressing the weights in their PoT exponent form. DoubleQ also reduces the memory storage requirement for CNN, as only the exponent of the weights is needed for storage. However, DoubleQ trades the accuracy of the network for reduced memory storage. To recover the accuracy, a selection process (DoubleQExt) was proposed to strategically select some of the less informative layers in the network to be quantized with PoT at the second level. On ResNet-20, the proposed DoubleQ can reduce the memory consumption by 37.50% with 7.28% accuracy degradation compared to 8-bit quantization. By applying DoubleQExt, the accuracy is only degraded by 1.19% compared to 8-bit version while achieving a memory reduction of 23.05%. This result is also 1% more accurate than the state-of-the-art work (SegLog). The proposed DoubleQExt also allows flexible configuration to trade off the memory consumption with better accuracy, which is not found in the other state-of-the-art works. With the proposed two-level weight quantization, one can achieve a more efficient hardware architecture for CNN with minimal impact to the accuracy, which is crucial for IoT applications.

Files in This Item: There are no files associated with this item.

Appears in Collections: IT융합대학 > 컴퓨터공학과 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Hwang, Seong Oun photo

Hwang, Seong Oun: College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :4,149,905; Today View :11,237

RSS_1.0 RSS_2.0 ATOM_1.0

1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE