RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models

Kim, Minsoo; Lee, Sihwa; Sung, Wonyong; Choi, Jungwook

doi:10.18653/v1/2024.findings-acl.933

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Minsoo	-
dc.contributor.author	Lee, Sihwa	-
dc.contributor.author	Sung, Wonyong	-
dc.contributor.author	Choi, Jungwook	-
dc.date.accessioned	2025-04-10T02:30:17Z	-
dc.date.available	2025-04-10T02:30:17Z	-
dc.date.issued	2024-08	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207022	-
dc.description.abstract	Deploying large language models (LLMs) with their extensive parameters and high memory demands challenges computational efficiency, particularly in fine-tuning for specific applications with limited resources. Techniques like LowRank Adaptation (LoRA) help by training a smaller, modifiable extension of the base model to reduce memory usage. However, combining quantization with LoRA, especially in low-bit scenarios, can lead to performance losses due to quantization errors. Our innovative RankAdaptive LoRA (RA-LoRA) addresses this by dynamically adjusting the adapter's rank using rank-subspace analysis, optimizing performance with fewer parameters. We tested RALoRA on state-of-the-art LLMs for 2-bit efficient fine-tuning, showing it can improve model accuracy with minimal trainable parameters, marking a leap forward in quantization-aware fine-tuning methods and highlighting the significance of rank dynamics in optimizing quantized LLMs.	-
dc.format.extent	14	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	ASSOC COMPUTATIONAL LINGUISTICS-ACL	-
dc.title	RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.18653/v1/2024.findings-acl.933	-
dc.identifier.scopusid	2-s2.0-85205315830	-
dc.identifier.wosid	001391786807030	-
dc.identifier.bibliographicCitation	FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, pp 15773 - 15786	-
dc.citation.title	FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024	-
dc.citation.startPage	15773	-
dc.citation.endPage	15786	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Interdisciplinary Applications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.subject.keywordPlus	Problem oriented languages	-
dc.identifier.url	https://aclanthology.org/2024.findings-acl.933/	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Choi, Jung wook photo

Choi, Jung wook: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE