Detailed Information

Cited 1 time in webofscience Cited 1 time in scopus
Metadata Downloads

Low Latency Implementations of CNN for Resource-Constrained IoT Devices

Full metadata record
DC Field Value Language
dc.contributor.authorMujtaba, Ahmed-
dc.contributor.authorLee, Wai-Kong-
dc.contributor.authorHwang, Seong Oun-
dc.date.accessioned2023-03-14T06:40:32Z-
dc.date.available2023-03-14T06:40:32Z-
dc.date.created2023-03-14-
dc.date.issued2022-12-
dc.identifier.issn1549-7747-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/87089-
dc.description.abstractConvolutional Neural Network (CNN) inference on a resource-constrained Internet-of-Things (IoT) device (i.e., ARM Cortex-M microcontroller) requires careful optimization to reduce the timing overhead. We propose two novel techniques to improve the computational efficiency of CNNs by targeting low-cost microcontrollers. Our techniques utilize on-chip memory and minimize redundant operations, yielding low-latency inference results on complex quantized models such as MobileNetV1. On the ImageNet dataset for per-layer quantization, we reduce inference latency and Multiply-and-Accumulate (MAC) per cycle by 22.4% and 22.9%, respectively, compared to the state-of-theart mixed-precision CMix-NN library. On the CIFAR-10 dataset for per-channel quantization, we reduce inference latency and MAC per cycle by 31.7% and 31.3%, respectively. The achieved low-latency inference results can improve the user experience and save power budget in resource-constrained IoT devices.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.relation.isPartOfIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS-
dc.titleLow Latency Implementations of CNN for Resource-Constrained IoT Devices-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000922028300100-
dc.identifier.doi10.1109/TCSII.2022.3205029-
dc.identifier.bibliographicCitationIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, v.69, no.12, pp.5124 - 5128-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85137943322-
dc.citation.endPage5128-
dc.citation.startPage5124-
dc.citation.titleIEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS-
dc.citation.volume69-
dc.citation.number12-
dc.contributor.affiliatedAuthorMujtaba, Ahmed-
dc.contributor.affiliatedAuthorLee, Wai-Kong-
dc.contributor.affiliatedAuthorHwang, Seong Oun-
dc.type.docTypeArticle-
dc.subject.keywordAuthorConvolutional neural networks-
dc.subject.keywordAuthorInternet-of-Things-
dc.subject.keywordAuthormicrocontrollers-
dc.subject.keywordAuthortiny ML-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 컴퓨터공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Hwang, Seong Oun photo

Hwang, Seong Oun
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE