Cited 0 time in
Accurate and High-Throughput Analog-Digital DNN Acceleration using Sub-Network Scheduling
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Jintae | - |
| dc.contributor.author | Jeong, Byoungjun | - |
| dc.contributor.author | Kim, Changdae | - |
| dc.contributor.author | Ryu, Narae | - |
| dc.contributor.author | Pak, Eunji | - |
| dc.contributor.author | Lee, Hunjun | - |
| dc.date.accessioned | 2026-06-22T06:30:14Z | - |
| dc.date.available | 2026-06-22T06:30:14Z | - |
| dc.date.issued | 2026-01 | - |
| dc.identifier.issn | 1556-6056 | - |
| dc.identifier.issn | 1556-6064 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/214007 | - |
| dc.description.abstract | Analog Computing-in-Memory (ACiM) devices perform matrix operations directly within memory arrays, offering high throughput for deep neural network inference. At the same time, they are susceptible to various noise sources, which reduces the computational accuracy. This inherent trade-off limits the adoption of ACiM devices as a stand-alone accelerator. In this paper, we propose the first heterogeneous inference-serving framework that coordinates ACiM hardware and conventional digital processors during inference. The system dynamically partitions each neural network into an analog sub-network executed on high-throughput ACiM devices and a digital sub-network handled by precise digital devices. Then, it integrates a transition layer and a custom training strategy to maintain accuracy across varying analog-digital partitions. Also, it adopts a two-level scheduler to adjust the partition ratio at runtime in response to varying query demands. We evaluate our system using five network models to demonstrate the benefits of our system. | - |
| dc.format.extent | 4 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | IEEE COMPUTER SOC | - |
| dc.title | Accurate and High-Throughput Analog-Digital DNN Acceleration using Sub-Network Scheduling | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/LCA.2026.3668766 | - |
| dc.identifier.scopusid | 2-s2.0-105031560609 | - |
| dc.identifier.wosid | 001719505100004 | - |
| dc.identifier.bibliographicCitation | IEEE COMPUTER ARCHITECTURE LETTERS, v.25, no.1, pp 97 - 100 | - |
| dc.citation.title | IEEE COMPUTER ARCHITECTURE LETTERS | - |
| dc.citation.volume | 25 | - |
| dc.citation.number | 1 | - |
| dc.citation.startPage | 97 | - |
| dc.citation.endPage | 100 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Hardware & Architecture | - |
| dc.subject.keywordPlus | Analog computers | - |
| dc.subject.keywordPlus | Analog to digital conversion | - |
| dc.subject.keywordPlus | Deep neural networks | - |
| dc.subject.keywordPlus | Digital devices | - |
| dc.subject.keywordPlus | Memory architecture | - |
| dc.subject.keywordPlus | Throughput | - |
| dc.subject.keywordAuthor | Accuracy | - |
| dc.subject.keywordAuthor | Throughput | - |
| dc.subject.keywordAuthor | Computational modeling | - |
| dc.subject.keywordAuthor | Load modeling | - |
| dc.subject.keywordAuthor | Analog-digital conversion | - |
| dc.subject.keywordAuthor | Personal digital devices | - |
| dc.subject.keywordAuthor | Runtime | - |
| dc.subject.keywordAuthor | Noise | - |
| dc.subject.keywordAuthor | Hardware | - |
| dc.subject.keywordAuthor | Processor scheduling | - |
| dc.subject.keywordAuthor | Compute-in-memory | - |
| dc.subject.keywordAuthor | hybrid analog-digital system | - |
| dc.subject.keywordAuthor | accuracy scaling | - |
| dc.subject.keywordAuthor | sub-network scheduling | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/11415650 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
