Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Non-Zero Grid for Accurate 2-Bit Additive Power-of-Two CNN Quantization

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Young Min-
dc.contributor.authorHan, Kyunghyun-
dc.contributor.authorLee, Wai-Kong-
dc.contributor.authorChang, Hyung Jin-
dc.contributor.authorHwang, Seong Oun-
dc.date.accessioned2023-05-16T08:41:53Z-
dc.date.available2023-05-16T08:41:53Z-
dc.date.created2023-05-15-
dc.date.issued2023-03-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/87771-
dc.description.abstractQuantization is an effective technique to reduce the memory and computational complexity of CNNs. Recent advances utilize additive powers-of-two to perform non-uniform quantization, which resembles a normal distribution and shows better performance than uniform quantization. With powers-of-two quantization, the computational complexity is also largely reduced because the slow multiplication operations are replaced with lightweight shift operations. However, there are serious problems in the previously proposed grid formulation for 2-bit quantization. In particular, these powers-of-two schemes produce zero values, generating significant training error and causing low accuracy. In addition, due to improper grid formulation, they also fallback to uniform quantization when the quantization level reaches 2-bit. Due to these reasons, on large CNN like ResNet-110, these powers-of-two schemes may not even train properly. To resolve these issues, we propose a new non-zero grid formulation that enables 2-bit non-uniform quantization and allow the CNN to be trained successfully in every attempt, even for a large network. The proposed technique quantizes weight as power-of-two values and projects it close to the mean area through a simple constant product on the exponential part. This allows our quantization scheme to closely resemble a non-uniform quantization at 2-bit, enabling successful training at 2-bit quantization, which is not found in the previous work. The proposed technique achieves 70.57% accuracy on the CIFAR-100 dataset trained with ResNet-110. This result is 6.24% higher than the additive powers-of-two scheme which only achieves 64.33% accuracy. Beside achieving higher accuracy, our work also maintains the same memory and computational efficiency with the original additive powers-of-two scheme.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.relation.isPartOfIEEE ACCESS-
dc.titleNon-Zero Grid for Accurate 2-Bit Additive Power-of-Two CNN Quantization-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000967563900001-
dc.identifier.doi10.1109/ACCESS.2023.3259959-
dc.identifier.bibliographicCitationIEEE ACCESS, v.11, pp.32051 - 32060-
dc.description.isOpenAccessY-
dc.identifier.scopusid2-s2.0-85151499830-
dc.citation.endPage32060-
dc.citation.startPage32051-
dc.citation.titleIEEE ACCESS-
dc.citation.volume11-
dc.contributor.affiliatedAuthorKim, Young Min-
dc.contributor.affiliatedAuthorLee, Wai-Kong-
dc.contributor.affiliatedAuthorHwang, Seong Oun-
dc.type.docTypeArticle-
dc.subject.keywordAuthorQuantization (signal)-
dc.subject.keywordAuthorDeep learning-
dc.subject.keywordAuthorConvolutional neural networks-
dc.subject.keywordAuthorGaussian distribution-
dc.subject.keywordAuthorMathematical models-
dc.subject.keywordAuthorInternet of Things-
dc.subject.keywordAuthorComputational modeling-
dc.subject.keywordAuthorQuantization-
dc.subject.keywordAuthordeep learning-
dc.subject.keywordAuthorconvolutional neural network-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 컴퓨터공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Hwang, Seong Oun photo

Hwang, Seong Oun
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE