A Weight-Sharing Autoencoder with Dynamic Quantization for Efficient Feature Compression
- Authors
- Choi, J.S.[Choi, J.S.]; Kim, J.[Kim, J.]; Ko, J.H.[Ko, J.H.]
- Issue Date
- 2021
- Publisher
- IEEE Computer Society
- Keywords
- Autoencoder; Collaborative Inference; Dynamic Quantization; Feature Compression
- Citation
- International Conference on ICT Convergence, v.2021-October, pp.1111 - 1113
- Indexed
- SCOPUS
- Journal Title
- International Conference on ICT Convergence
- Volume
- 2021-October
- Start Page
- 1111
- End Page
- 1113
- URI
- https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/95343
- DOI
- 10.1109/ICTC52510.2021.9620912
- ISSN
- 2162-1233
- Abstract
- Collaborative inference (CI) enhances the inference efficiency of deep neural networks (DNNs) by partitioning a computational workload between an edge device and a cloud platform. Efficient inference using CI requires searching for the optimal partition layer that minimizes the end-to-end inference latency. In addition, the intermediate feature at the partitioned layer should be effectively compressed. However, recent DNN-based feature compression methods require independent models dedicated for each partition point, resulting in significant storage overhead. In this paper, we propose a novel method that efficiently compresses the features from variable partition layers using a single autoencoder. The proposed method incorporates a weight-sharing technique that shares the weights of autoencoders that compress each partition layer. In addition, dynamic bitwidths quantization is supported for flexibility in compression ratio. The experimental results show that the proposed method reduced the required parameter size by 4× compared to the existing independent model based method, while maintaining the accuracy loss within 0.5%. © 2021 IEEE.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - Information and Communication Engineering > School of Electronic and Electrical Engineering > 1. Journal Articles
- Information and Communication Engineering > Department of Semiconductor Systems Engineering > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/95343)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.