Cited 0 time in
Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Hwang, Youngdeok | - |
| dc.contributor.author | Lee, Janghwan | - |
| dc.contributor.author | Park, Jiwoong | - |
| dc.contributor.author | Lim, Jieun | - |
| dc.contributor.author | Choi, Jungwook | - |
| dc.date.accessioned | 2024-11-28T14:31:32Z | - |
| dc.date.available | 2024-11-28T14:31:32Z | - |
| dc.date.issued | 2024-01 | - |
| dc.identifier.issn | 2574-1403 | - |
| dc.identifier.issn | 2767-7699 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196964 | - |
| dc.description.abstract | Large Language Models (LLMs) have shown remarkable success in various natural language processing tasks. However, their extensive parameter count leads to significant memory and computational demands. To tackle these challenges, there is growing interest in employing post-training quantization (PTQ) with reduced-precision floating-point (FP) operations. Yet, the optimal FP configuration remains a topic of debate. Existing studies often overlook a thorough analysis of the diverse data distributions found in LLMs and the crucial design choice, denormal. In this paper, we conduct a comprehensive examination of the various data distributions within LLMs and the significance of denormal representation, presenting a mixed-format floating-point framework. Our proposed framework allows for sub-8-bit inference with minimal performance degradation in language modeling and reasoning tasks across a broad spectrum of LLMs. | - |
| dc.format.extent | 4 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ICEIC61013.2024.10457111 | - |
| dc.identifier.scopusid | 2-s2.0-85189243662 | - |
| dc.identifier.bibliographicCitation | 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024, pp 1 - 4 | - |
| dc.citation.title | 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 4 | - |
| dc.type.docType | Conference paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordAuthor | floating-point | - |
| dc.subject.keywordAuthor | Large language model | - |
| dc.subject.keywordAuthor | mixed-format | - |
| dc.subject.keywordAuthor | post-training quantization | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10457111 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
