포스트에디팅 결과물의 정확성 오류 고찰 —AI 학습용 금융/증시 분야 한-영 번역 말뭉치를 대상으로—Accuracy errors in post-edited output, based on Korean-English parallel corpus for AI training
- Authors
- 김자경
- Issue Date
- Nov-2021
- Publisher
- 한국통역번역학회
- Keywords
- post-editing; quality of post-edited output; parallel translation corpus for AI training; post-editing education; post-editing guidelines
- Citation
- 통역과 번역, v.23, no.3, pp 29 - 57
- Pages
- 29
- Journal Title
- 통역과 번역
- Volume
- 23
- Number
- 3
- Start Page
- 29
- End Page
- 57
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/62069
- DOI
- 10.20305/it202103029058
- ISSN
- 1229-6074
- Abstract
- In sharp contrast to great attention to the quality of Machine Translation (MT) raw output, the quality of post-edited output has drawn relatively little attention in Korean translation studies, although some errors in MT output can remain even after post-editing. Against this backdrop, this study sets out to investigate accuracy errors in post-edited output, based on Korean-English parallel translation corpus for AI training released in June 2021 by the National Information Society Agency. For this purpose, 200 parallel sentences with accuracy errors were collected and classified by error type. According to the analysis results, mistranslation errors account for about two-thirds, with the rest in omissions, indicating that quite a number of omissions are still left in post-edited output. While lexical errors ranging from words to clauses are found most frequently in mistranslations, syntax errors represent a surprisingly large portion, with many errors in modifiers and subjects. This study draws attention to quality in MT post-editing, suggesting the need for further investigation into factors affecting the quality of post-edited output.
- Files in This Item
-
- Appears in
Collections - Graduate School of International Studies > Advanced Interpretation & Translation Program > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.