Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Architecture-Aware Optimization of Layer Fusion for Latency-Optimal CNN Inference

Full metadata record
DC Field Value Language
dc.contributor.authorYoon, Minyong-
dc.contributor.authorChoi, Jungwook-
dc.date.accessioned2023-08-22T02:58:33Z-
dc.date.available2023-08-22T02:58:33Z-
dc.date.created2023-08-17-
dc.date.issued2023-06-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/189408-
dc.description.abstractLayer fusion is an effective technique for accelerating latency-sensitive CNN inference tasks on resource-constrained accelerators that exploit distributed on-chip integrated memory-accelerator processing-in memory (PIM). However, previous research primarily focused on optimizing memory access, neglecting the significant impact of hardware architecture on latency. This study presents an analytical latency model for a 2D systolic array accelerator, taking into account various hardware factors such as array dimensions, buffer size, and bandwidth. We then investigate the influence of hardware architecture and fusion strategies, including weight and overlap reuse, on performance; these aspects are insufficiently addressed in existing access-based fusion models. By incorporating layer fusion with our proposed latency model across different architectures, dataflows, and workloads, we achieve up to a 53.1% reduction in end-to-end network latency compared to an access-based model.-
dc.language영어-
dc.language.isoen-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleArchitecture-Aware Optimization of Layer Fusion for Latency-Optimal CNN Inference-
dc.typeArticle-
dc.contributor.affiliatedAuthorChoi, Jungwook-
dc.identifier.doi10.1109/AICAS57966.2023.10168659-
dc.identifier.scopusid2-s2.0-85166380794-
dc.identifier.bibliographicCitationAICAS 2023 - IEEE International Conference on Artificial Intelligence Circuits and Systems, Proceeding, pp.1 - 4-
dc.relation.isPartOfAICAS 2023 - IEEE International Conference on Artificial Intelligence Circuits and Systems, Proceeding-
dc.citation.titleAICAS 2023 - IEEE International Conference on Artificial Intelligence Circuits and Systems, Proceeding-
dc.citation.startPage1-
dc.citation.endPage4-
dc.type.rimsART-
dc.type.docTypeConference paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusConvolutional neural networks-
dc.subject.keywordPlusData flow analysis-
dc.subject.keywordPlusMemory architecture-
dc.subject.keywordPlusNetwork architecture-
dc.subject.keywordPlusSystolic arrays-
dc.subject.keywordPlusAnalytic cost model-
dc.subject.keywordPlusConvolutional neural network-
dc.subject.keywordPlusCost models-
dc.subject.keywordPlusDataflow optimization-
dc.subject.keywordPlusHardware architecture-
dc.subject.keywordPlusLatency model-
dc.subject.keywordPlusLayer fusion-
dc.subject.keywordPlusOn chips-
dc.subject.keywordPlusOptimisations-
dc.subject.keywordPlusProcessing-in-memory-
dc.subject.keywordAuthoranalytic cost model-
dc.subject.keywordAuthorconvolutional neural network-
dc.subject.keywordAuthordataflow optimization-
dc.subject.keywordAuthorLayer fusion-
dc.subject.keywordAuthorsystolic array-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10168659-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Jung wook photo

Choi, Jung wook
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE