Detailed Information

Cited 1 time in webofscience Cited 1 time in scopus
Metadata Downloads

Minimizing Global Buffer Access in a Deep Learning Accelerator Using a Local Register File with a Rearranged Computational Sequence

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Minjae-
dc.contributor.authorZhang, Zhongfeng-
dc.contributor.authorChoi, Seungwon-
dc.contributor.authorChoi, Jungwook-
dc.date.accessioned2022-07-06T06:24:53Z-
dc.date.available2022-07-06T06:24:53Z-
dc.date.created2022-05-04-
dc.date.issued2022-04-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/138975-
dc.description.abstractWe propose a method for minimizing global buffer access within a deep learning accelerator for convolution operations by maximizing the data reuse through a local register file, thereby substituting the local register file access for the power-hungry global buffer access. To fully exploit the merits of data reuse, this study proposes a rearrangement of the computational sequence in a deep learning accelerator. Once input data are read from the global buffer, repeatedly reading the same data is performed only through the local register file, saving significant power consumption. Furthermore, different from prior works that equip local register files in each computation unit, the proposed method enables sharing a local register file along the column of the 2D computation array, saving resources and controlling overhead. The proposed accelerator is implemented on an off-the-shelf field-programmable gate array to verify the functionality and resource utilization. Then, the performance improvement of the proposed method is demonstrated relative to popular deep learning accelerators. Our evaluation indicates that the proposed deep learning accelerator reduces the number of global-buffer accesses to nearly 86.8%, consequently saving up to 72.3% of the power consumption for the input data memory access with a minor increase in resource usage compared to a conventional deep learning accelerator.-
dc.language영어-
dc.language.isoen-
dc.publisherMDPI-
dc.titleMinimizing Global Buffer Access in a Deep Learning Accelerator Using a Local Register File with a Rearranged Computational Sequence-
dc.typeArticle-
dc.contributor.affiliatedAuthorChoi, Seungwon-
dc.contributor.affiliatedAuthorChoi, Jungwook-
dc.identifier.doi10.3390/s22083095-
dc.identifier.scopusid2-s2.0-85128351043-
dc.identifier.wosid000786834000001-
dc.identifier.bibliographicCitationSENSORS, v.22, no.8, pp.1 - 24-
dc.relation.isPartOfSENSORS-
dc.citation.titleSENSORS-
dc.citation.volume22-
dc.citation.number8-
dc.citation.startPage1-
dc.citation.endPage24-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaInstruments & Instrumentation-
dc.relation.journalWebOfScienceCategoryChemistry, Analytical-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryInstruments & Instrumentation-
dc.subject.keywordPlusNEURAL-NETWORKS-
dc.subject.keywordPlusHARDWARE ACCELERATOR-
dc.subject.keywordPlusDATA-FLOW-
dc.subject.keywordPlusCNN-
dc.subject.keywordAuthordeep learning accelerator-
dc.subject.keywordAuthorfield-programmable gate array (FPGA)-
dc.subject.keywordAuthorlocal register file-
dc.subject.keywordAuthorrearrangement of computational sequence-
dc.identifier.urlhttps://www.mdpi.com/1424-8220/22/8/3095-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Seung won photo

Choi, Seung won
서울 공과대학 (서울 융합전자공학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE