Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference

Hwang, Youngdeok; Lee, Janghwan; Park, Jiwoong; Lim, Jieun; Choi, Jungwook

doi:10.1109/ICEIC61013.2024.10457111

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference

Full metadata record

DC Field	Value	Language
dc.contributor.author	Hwang, Youngdeok	-
dc.contributor.author	Lee, Janghwan	-
dc.contributor.author	Park, Jiwoong	-
dc.contributor.author	Lim, Jieun	-
dc.contributor.author	Choi, Jungwook	-
dc.date.accessioned	2024-11-28T14:31:32Z	-
dc.date.available	2024-11-28T14:31:32Z	-
dc.date.issued	2024-01	-
dc.identifier.issn	2574-1403	-
dc.identifier.issn	2767-7699	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196964	-
dc.description.abstract	Large Language Models (LLMs) have shown remarkable success in various natural language processing tasks. However, their extensive parameter count leads to significant memory and computational demands. To tackle these challenges, there is growing interest in employing post-training quantization (PTQ) with reduced-precision floating-point (FP) operations. Yet, the optimal FP configuration remains a topic of debate. Existing studies often overlook a thorough analysis of the diverse data distributions found in LLMs and the crucial design choice, denormal. In this paper, we conduct a comprehensive examination of the various data distributions within LLMs and the significance of denormal representation, presenting a mixed-format floating-point framework. Our proposed framework allows for sub-8-bit inference with minimal performance degradation in language modeling and reasoning tasks across a broad spectrum of LLMs.	-
dc.format.extent	4	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ICEIC61013.2024.10457111	-
dc.identifier.scopusid	2-s2.0-85189243662	-
dc.identifier.bibliographicCitation	2024 International Conference on Electronics, Information, and Communication, ICEIC 2024, pp 1 - 4	-
dc.citation.title	2024 International Conference on Electronics, Information, and Communication, ICEIC 2024	-
dc.citation.startPage	1	-
dc.citation.endPage	4	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordAuthor	floating-point	-
dc.subject.keywordAuthor	Large language model	-
dc.subject.keywordAuthor	mixed-format	-
dc.subject.keywordAuthor	post-training quantization	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10457111	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Choi, Jung wook photo

Choi, Jung wook: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE