AutoAugment Is What You Need: Enhancing Rule-based Augmentation Methods in Low-resource Regimes
- Authors
- Choi, Juhwan; Jin, Kyohoon; Lee, Junho; Song, Sangmin; Kim, Youngbin
- Issue Date
- 2024
- Publisher
- Association for Computational Linguistics (ACL)
- Citation
- EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Student Research Workshop, pp 1 - 8
- Pages
- 8
- Journal Title
- EACL 2024 - 18th Conference of the European Chapter of the Association for Computational Linguistics, Proceedings of the Student Research Workshop
- Start Page
- 1
- End Page
- 8
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/73160
- ISSN
- 0000-0000
- Abstract
- Text data augmentation is a complex problem due to the discrete nature of sentences. Although rule-based augmentation methods are widely adopted in real-world applications because of their simplicity, they suffer from potential semantic damage. Previous researchers have suggested easy data augmentation with soft labels (softEDA), employing label smoothing to mitigate this problem. However, finding the best factor for each model and dataset is challenging; therefore, using softEDA in real-world applications is still difficult. In this paper, we propose adapting AutoAugment to solve this problem. The experimental results suggest that the proposed method can boost existing augmentation methods and that rule-based methods can enhance cutting-edge pre-trained language models. We offer the source code. © 2024 Association for Computational Linguistics.
- Files in This Item
-
- Appears in
Collections - Graduate School of Advanced Imaging Sciences, Multimedia and Film > Department of Imaging Science and Arts > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.