Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

GPTs Are Multilingual Annotators for Sequence Generation Tasks

Authors
Choi, JuhwanLee, EunjuJin, KyohoonKim, Youngbin
Issue Date
2024
Publisher
Association for Computational Linguistics (ACL)
Citation
EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2024, pp 17 - 40
Pages
24
Journal Title
EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2024
Start Page
17
End Page
40
URI
https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/73152
ISSN
0000-0000
Abstract
Data annotation is an essential step for constructing new datasets. However, the conventional approach of data annotation through crowdsourcing is both time-consuming and expensive. In addition, the complexity of this process increases when dealing with low-resource languages owing to the difference in the language pool of crowdworkers. To address these issues, this study proposes an autonomous annotation method by utilizing large language models, which have been recently demonstrated to exhibit remarkable performance. Through our experiments, we demonstrate that the proposed method is not just cost-efficient but also applicable for low-resource language annotation. Additionally, we constructed an image captioning dataset using our approach and are committed to open this dataset for future study. We have opened our source code for reproducibility. © 2024 Association for Computational Linguistics.
Files in This Item
Appears in
Collections
Graduate School of Advanced Imaging Sciences, Multimedia and Film > Department of Imaging Science and Arts > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Young Bin photo

Kim, Young Bin
첨단영상대학원 (영상학과)
Read more

Altmetrics

Total Views & Downloads

BROWSE