Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Finding Optimal Numerical Format for Sub-8-Bit Post-Training Quantization of Vision Transformers

Full metadata record
DC Field Value Language
dc.contributor.author이장환-
dc.contributor.authorHwang, Youngdeok-
dc.contributor.authorChoi, Jungwook-
dc.date.accessioned2024-11-28T10:31:42Z-
dc.date.available2024-11-28T10:31:42Z-
dc.date.issued2023-06-
dc.identifier.issn0736-7791-
dc.identifier.issn1520-6149-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196142-
dc.description.abstractVision Transformers (ViTs) have gained significant attention for their exceptional model accuracies on computer vision applications, but their demanding memory requirements and computational complexity have hindered active deployment. Post-training quantization (PTQ) is a practical method to tackle this challenge by directly reducing ViT's bit-precision. However, diverse data characteristics across different operations of ViT cannot be well captured solely by a single numerical format (fixed or floating-point). This work proposes an analytical framework that optimizes the numerical format of each matrix multiplication of ViTs for mixed-format sub-8bit quantization. The extensive evaluation demonstrates that the proposed method can reduce the PTQ error and achieve state-of-the-art accuracy for popular ViT models.-
dc.format.extent5-
dc.language영어-
dc.language.isoENG-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleFinding Optimal Numerical Format for Sub-8-Bit Post-Training Quantization of Vision Transformers-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ICASSP49357.2023.10096798-
dc.identifier.scopusid2-s2.0-85180571986-
dc.identifier.bibliographicCitationICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5-
dc.citation.titleICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings-
dc.citation.startPage1-
dc.citation.endPage5-
dc.type.docTypeConference paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordAuthorfixed-point-
dc.subject.keywordAuthorfloating-point-
dc.subject.keywordAuthorPost-training quantization-
dc.subject.keywordAuthorvision Transformer-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/10096798-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Jung wook photo

Choi, Jung wook
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE