Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

RA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Minsoo-
dc.contributor.authorLee, Sihwa-
dc.contributor.authorSung, Wonyong-
dc.contributor.authorChoi, Jungwook-
dc.date.accessioned2025-04-10T02:30:17Z-
dc.date.available2025-04-10T02:30:17Z-
dc.date.issued2024-08-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/207022-
dc.description.abstractDeploying large language models (LLMs) with their extensive parameters and high memory demands challenges computational efficiency, particularly in fine-tuning for specific applications with limited resources. Techniques like LowRank Adaptation (LoRA) help by training a smaller, modifiable extension of the base model to reduce memory usage. However, combining quantization with LoRA, especially in low-bit scenarios, can lead to performance losses due to quantization errors. Our innovative RankAdaptive LoRA (RA-LoRA) addresses this by dynamically adjusting the adapter's rank using rank-subspace analysis, optimizing performance with fewer parameters. We tested RALoRA on state-of-the-art LLMs for 2-bit efficient fine-tuning, showing it can improve model accuracy with minimal trainable parameters, marking a leap forward in quantization-aware fine-tuning methods and highlighting the significance of rank dynamics in optimizing quantized LLMs.-
dc.format.extent14-
dc.language영어-
dc.language.isoENG-
dc.publisherASSOC COMPUTATIONAL LINGUISTICS-ACL-
dc.titleRA-LoRA: Rank-Adaptive Parameter-Efficient Fine-Tuning for Accurate 2-bit Quantized Large Language Models-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.18653/v1/2024.findings-acl.933-
dc.identifier.scopusid2-s2.0-85205315830-
dc.identifier.wosid001391786807030-
dc.identifier.bibliographicCitationFINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, pp 15773 - 15786-
dc.citation.titleFINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024-
dc.citation.startPage15773-
dc.citation.endPage15786-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Interdisciplinary Applications-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordPlusProblem oriented languages-
dc.identifier.urlhttps://aclanthology.org/2024.findings-acl.933/-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Jung wook photo

Choi, Jung wook
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE