Detailed Information

Cited 1 time in webofscience Cited 1 time in scopus
Metadata Downloads

A Weight-Sharing Autoencoder with Dynamic Quantization for Efficient Feature Compression

Full metadata record
DC Field Value Language
dc.contributor.authorChoi, J.S.[Choi, J.S.]-
dc.contributor.authorKim, J.[Kim, J.]-
dc.contributor.authorKo, J.H.[Ko, J.H.]-
dc.date.accessioned2022-02-24T04:41:11Z-
dc.date.available2022-02-24T04:41:11Z-
dc.date.created2022-02-24-
dc.date.issued2021-
dc.identifier.issn2162-1233-
dc.identifier.urihttps://scholarworks.bwise.kr/skku/handle/2021.sw.skku/95343-
dc.description.abstractCollaborative inference (CI) enhances the inference efficiency of deep neural networks (DNNs) by partitioning a computational workload between an edge device and a cloud platform. Efficient inference using CI requires searching for the optimal partition layer that minimizes the end-to-end inference latency. In addition, the intermediate feature at the partitioned layer should be effectively compressed. However, recent DNN-based feature compression methods require independent models dedicated for each partition point, resulting in significant storage overhead. In this paper, we propose a novel method that efficiently compresses the features from variable partition layers using a single autoencoder. The proposed method incorporates a weight-sharing technique that shares the weights of autoencoders that compress each partition layer. In addition, dynamic bitwidths quantization is supported for flexibility in compression ratio. The experimental results show that the proposed method reduced the required parameter size by 4× compared to the existing independent model based method, while maintaining the accuracy loss within 0.5%. © 2021 IEEE.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE Computer Society-
dc.titleA Weight-Sharing Autoencoder with Dynamic Quantization for Efficient Feature Compression-
dc.typeArticle-
dc.contributor.affiliatedAuthorChoi, J.S.[Choi, J.S.]-
dc.contributor.affiliatedAuthorKim, J.[Kim, J.]-
dc.contributor.affiliatedAuthorKo, J.H.[Ko, J.H.]-
dc.identifier.doi10.1109/ICTC52510.2021.9620912-
dc.identifier.scopusid2-s2.0-85122921654-
dc.identifier.wosid000790235800265-
dc.identifier.bibliographicCitationInternational Conference on ICT Convergence, v.2021-October, pp.1111 - 1113-
dc.relation.isPartOfInternational Conference on ICT Convergence-
dc.citation.titleInternational Conference on ICT Convergence-
dc.citation.volume2021-October-
dc.citation.startPage1111-
dc.citation.endPage1113-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordAuthorAutoencoder-
dc.subject.keywordAuthorCollaborative Inference-
dc.subject.keywordAuthorDynamic Quantization-
dc.subject.keywordAuthorFeature Compression-
Files in This Item
There are no files associated with this item.
Appears in
Collections
Information and Communication Engineering > Department of Semiconductor Systems Engineering > 1. Journal Articles
Information and Communication Engineering > School of Electronic and Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher KO, JONG HWAN photo

KO, JONG HWAN
Information and Communication Engineering (Electronic and Electrical Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE