Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Efficient Federated Learning with Pre-Trained Large Language Model Using Several Adapter Mechanismsopen access

Authors
Kim, GyunyeopYoo, JoonKang, Sangwoo
Issue Date
Nov-2023
Publisher
MDPI
Keywords
federated learning; deep learning; transfer learning; adapter transformer
Citation
MATHEMATICS, v.11, no.21
Journal Title
MATHEMATICS
Volume
11
Number
21
URI
https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/89516
DOI
10.3390/math11214479
ISSN
2227-7390
2227-7390
Abstract
Recent advancements in deep learning have led to various challenges, one of which is the issue of data privacy in training data. To address this issue, federated learning, a technique that merges models trained by clients on servers, has emerged as an attractive solution. However, federated learning faces challenges related to data heterogeneity and system heterogeneity. Recent observations suggest that incorporating pre-trained models into federated learning can mitigate some of these challenges. Nonetheless, the main drawback of pre-trained models lies in their typically large model size, leading to excessive data transmission when clients send these models to the server. Additionally, federated learning involves multiple global steps, which means transmitting a large language model to multiple clients results in too much data exchange. In this paper, we propose a novel approach to address this challenge using adapters. Adapters demonstrate training efficiency by training a small capacity adapter layer alongside a large language model. This unique characteristic reduces the volume of data transmission, offering a practical solution to the problem. The evaluation results demonstrate that the proposed method achieves a reduction in training time of approximately 20-40% and a transmission speed improvement of over 98% compared to previous approaches.
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yoo, Joon photo

Yoo, Joon
College of IT Convergence (Department of Software)
Read more

Altmetrics

Total Views & Downloads

BROWSE