Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Is Prompt Transfer Always Effective? An Empirical Study of Prompt Transfer for Question Answering

Full metadata record
DC Field Value Language
dc.contributor.authorJung, Minji-
dc.contributor.authorPark, Soyeon-
dc.contributor.authorSul, Jeewoo-
dc.contributor.authorChoi, Yong Suk-
dc.date.accessioned2025-08-05T07:30:28Z-
dc.date.available2025-08-05T07:30:28Z-
dc.date.issued2024-06-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208407-
dc.description.abstractPrompt tuning, which freezes all parameters of a pre-trained model and only trains a soft prompt, has emerged as a parameter-efficient approach. For the reason that the prompt initialization becomes sensitive when the model size is small, the prompt transfer that uses the trained prompt as an initialization for the target task has recently been introduced. Since previous works have compared tasks in large categories (e.g., summarization, sentiment analysis), the factors that influence prompt transfer have not been sufficiently explored. In this paper, we characterize the question answering task based on features such as answer format and empirically investigate the transferability of soft prompts for the first time. We analyze the impact of initialization during prompt transfer and find that the train dataset size of source and target tasks have the influence significantly. Furthermore, we propose a novel approach for measuring catastrophic forgetting and investigate how it occurs in terms of the amount of evidence. Our findings can help deeply understand transfer learning in prompt tuning(1).-
dc.format.extent12-
dc.language영어-
dc.language.isoENG-
dc.publisherASSOC COMPUTATIONAL LINGUISTICS-ACL-
dc.titleIs Prompt Transfer Always Effective? An Empirical Study of Prompt Transfer for Question Answering-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.18653/v1/2024.naacl-short.44-
dc.identifier.scopusid2-s2.0-85199527690-
dc.identifier.wosid001516394400044-
dc.identifier.bibliographicCitationPROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 2: SHORT PAPERS, pp 528 - 539-
dc.citation.titlePROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 2: SHORT PAPERS-
dc.citation.startPage528-
dc.citation.endPage539-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaLinguistics-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryLinguistics-
dc.subject.keywordPlusComputational linguistics-
dc.subject.keywordPlusTransfer learning-
dc.identifier.urlhttps://aclanthology.org/2024.naacl-short.44/-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE