Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Prompt-Augmented Linear Probing: Scaling beyond the Limit of Few-Shot In-Context Learners

Full metadata record
DC Field Value Language
dc.contributor.authorCho, Hyunsoo-
dc.contributor.authorKim, Hyuhng Joon-
dc.contributor.authorKim, Junyeob-
dc.contributor.authorLee, Sang-Woo-
dc.contributor.authorLee, Sang-goo-
dc.contributor.authorYoo, Kang Min-
dc.contributor.authorKim, Tae Uk-
dc.date.accessioned2023-09-11T01:53:54Z-
dc.date.available2023-09-11T01:53:54Z-
dc.date.created2023-07-20-
dc.date.issued2023-06-
dc.identifier.issn2159-5399-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/190393-
dc.description.abstractThrough in-context learning (ICL), large-scale language models are effective few-shot learners without additional model fine-tuning. However, the ICL performance does not scale well with the number of available training samples as it is limited by the inherent input length constraint of the underlying language model. Meanwhile, many studies have revealed that language models are also powerful feature extractors, allowing them to be utilized in a black-box manner and enabling the linear probing paradigm, where lightweight discriminators are trained on top of the pre-extracted input representations. This paper proposes prompt-augmented linear probing (PALP), a hybrid of linear probing and ICL, which leverages the best of both worlds. PALP inherits the scalability of linear probing and the capability of enforcing language models to derive more meaningful representations via tailoring input into a more conceivable form. Throughout in-depth investigations on various datasets, we verified that PALP significantly enhances the input representations closing the gap between ICL in the data-hungry scenario and fine-tuning in the data-abundant scenario with little training overhead, potentially making PALP a strong alternative in a black-box scenario.-
dc.language영어-
dc.language.isoen-
dc.publisherAssociation for the Advancement of Artificial Intelligence-
dc.titlePrompt-Augmented Linear Probing: Scaling beyond the Limit of Few-Shot In-Context Learners-
dc.typeArticle-
dc.contributor.affiliatedAuthorKim, Tae Uk-
dc.identifier.doi10.1609/aaai.v37i11.26495-
dc.identifier.scopusid2-s2.0-85164138550-
dc.identifier.bibliographicCitationAAAI Conference on Artificial Intelligence, v.37, no.11, pp.12709 - 12716-
dc.relation.isPartOfAAAI Conference on Artificial Intelligence-
dc.citation.titleAAAI Conference on Artificial Intelligence-
dc.citation.volume37-
dc.citation.number11-
dc.citation.startPage12709-
dc.citation.endPage12716-
dc.type.rimsART-
dc.type.docTypeProceeding-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusBlack boxes-
dc.subject.keywordPlusContext learning-
dc.subject.keywordPlusFine tuning-
dc.subject.keywordPlusIn contexts-
dc.subject.keywordPlusLanguage model-
dc.subject.keywordPlusLarge-scales-
dc.subject.keywordPlusLearning performance-
dc.subject.keywordPlusLinear probing-
dc.subject.keywordPlusScalings-
dc.subject.keywordPlusTraining sample-
dc.subject.keywordAuthorSNLP-
dc.subject.keywordAuthorLanguage Models, SNLP-
dc.subject.keywordAuthorText Classification-
dc.identifier.urlhttps://ojs.aaai.org/index.php/AAAI/article/view/26495-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Taeuk photo

Kim, Taeuk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE