Is Prompt Transfer Always Effective? An Empirical Study of Prompt Transfer for Question Answering

Jung, Minji; Park, Soyeon; Sul, Jeewoo; Choi, Yong Suk

doi:10.18653/v1/2024.naacl-short.44

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Is Prompt Transfer Always Effective? An Empirical Study of Prompt Transfer for Question Answering

Full metadata record

DC Field	Value	Language
dc.contributor.author	Jung, Minji	-
dc.contributor.author	Park, Soyeon	-
dc.contributor.author	Sul, Jeewoo	-
dc.contributor.author	Choi, Yong Suk	-
dc.date.accessioned	2025-08-05T07:30:28Z	-
dc.date.available	2025-08-05T07:30:28Z	-
dc.date.issued	2024-06	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208407	-
dc.description.abstract	Prompt tuning, which freezes all parameters of a pre-trained model and only trains a soft prompt, has emerged as a parameter-efficient approach. For the reason that the prompt initialization becomes sensitive when the model size is small, the prompt transfer that uses the trained prompt as an initialization for the target task has recently been introduced. Since previous works have compared tasks in large categories (e.g., summarization, sentiment analysis), the factors that influence prompt transfer have not been sufficiently explored. In this paper, we characterize the question answering task based on features such as answer format and empirically investigate the transferability of soft prompts for the first time. We analyze the impact of initialization during prompt transfer and find that the train dataset size of source and target tasks have the influence significantly. Furthermore, we propose a novel approach for measuring catastrophic forgetting and investigate how it occurs in terms of the amount of evidence. Our findings can help deeply understand transfer learning in prompt tuning(1).	-
dc.format.extent	12	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	ASSOC COMPUTATIONAL LINGUISTICS-ACL	-
dc.title	Is Prompt Transfer Always Effective? An Empirical Study of Prompt Transfer for Question Answering	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.18653/v1/2024.naacl-short.44	-
dc.identifier.scopusid	2-s2.0-85199527690	-
dc.identifier.wosid	001516394400044	-
dc.identifier.bibliographicCitation	PROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 2: SHORT PAPERS, pp 528 - 539	-
dc.citation.title	PROCEEDINGS OF THE 2024 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, VOL 2: SHORT PAPERS	-
dc.citation.startPage	528	-
dc.citation.endPage	539	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Linguistics	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Linguistics	-
dc.subject.keywordPlus	Computational linguistics	-
dc.subject.keywordPlus	Transfer learning	-
dc.identifier.url	https://aclanthology.org/2024.naacl-short.44/	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Choi, Yong Suk photo

Choi, Yong Suk: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE