A Weight-Sharing Autoencoder with Dynamic Quantization for Efficient Feature Compression
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Choi, J.S.[Choi, J.S.] | - |
dc.contributor.author | Kim, J.[Kim, J.] | - |
dc.contributor.author | Ko, J.H.[Ko, J.H.] | - |
dc.date.accessioned | 2022-02-24T04:41:11Z | - |
dc.date.available | 2022-02-24T04:41:11Z | - |
dc.date.created | 2022-02-24 | - |
dc.date.issued | 2021 | - |
dc.identifier.issn | 2162-1233 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/95343 | - |
dc.description.abstract | Collaborative inference (CI) enhances the inference efficiency of deep neural networks (DNNs) by partitioning a computational workload between an edge device and a cloud platform. Efficient inference using CI requires searching for the optimal partition layer that minimizes the end-to-end inference latency. In addition, the intermediate feature at the partitioned layer should be effectively compressed. However, recent DNN-based feature compression methods require independent models dedicated for each partition point, resulting in significant storage overhead. In this paper, we propose a novel method that efficiently compresses the features from variable partition layers using a single autoencoder. The proposed method incorporates a weight-sharing technique that shares the weights of autoencoders that compress each partition layer. In addition, dynamic bitwidths quantization is supported for flexibility in compression ratio. The experimental results show that the proposed method reduced the required parameter size by 4× compared to the existing independent model based method, while maintaining the accuracy loss within 0.5%. © 2021 IEEE. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | IEEE Computer Society | - |
dc.title | A Weight-Sharing Autoencoder with Dynamic Quantization for Efficient Feature Compression | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Choi, J.S.[Choi, J.S.] | - |
dc.contributor.affiliatedAuthor | Kim, J.[Kim, J.] | - |
dc.contributor.affiliatedAuthor | Ko, J.H.[Ko, J.H.] | - |
dc.identifier.doi | 10.1109/ICTC52510.2021.9620912 | - |
dc.identifier.scopusid | 2-s2.0-85122921654 | - |
dc.identifier.wosid | 000790235800265 | - |
dc.identifier.bibliographicCitation | International Conference on ICT Convergence, v.2021-October, pp.1111 - 1113 | - |
dc.relation.isPartOf | International Conference on ICT Convergence | - |
dc.citation.title | International Conference on ICT Convergence | - |
dc.citation.volume | 2021-October | - |
dc.citation.startPage | 1111 | - |
dc.citation.endPage | 1113 | - |
dc.type.rims | ART | - |
dc.type.docType | Proceedings Paper | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.subject.keywordAuthor | Autoencoder | - |
dc.subject.keywordAuthor | Collaborative Inference | - |
dc.subject.keywordAuthor | Dynamic Quantization | - |
dc.subject.keywordAuthor | Feature Compression | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(03063) 25-2, SUNGKYUNKWAN-RO, JONGNO-GU, SEOUL, KOREAsamsunglib@skku.edu
COPYRIGHT © 2021 SUNGKYUNKWAN UNIVERSITY ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.