Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Beyond Reference: Evaluating High Quality Translations Better than Human References

Authors
Noh, KeonwoongOh, SeokjinJung, Woohwan
Issue Date
Jun-2025
Publisher
ASSOC COMPUTATIONAL LINGUISTICS-ACL
Citation
2024 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2024, pp 5111 - 5127
Pages
17
Indexed
SCIE
Journal Title
2024 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING, EMNLP 2024
Start Page
5111
End Page
5127
URI
https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/126170
DOI
10.18653/v1/2024.emnlp-main.294
Abstract
In Machine Translation (MT) evaluations, the conventional approach is to compare a translated sentence against its human-created reference sentence. MT metrics provide an absolute score (e.g., from 0 to 1) to a candidate sentence based on the similarity with the reference sentence. Thus, existing MT metrics give the maximum score to the reference sentence. However, this approach overlooks the potential for a candidate sentence to exceed the reference sentence in terms of quality. In particular, recent advancements in Large Language Models (LLMs) have highlighted this issue, as LLM-generated sentences often exceed the quality of human-written sentences. To address the problem, we introduce the Residual score Metric (RESUME), which evaluates the relative quality between reference and candidate sentences. RESUME assigns a positive score to candidate sentences that outperform their reference sentences, and a negative score when they fall short. By adding the residual scores from RESUME to the absolute scores from MT metrics, it can be possible to allocate higher scores to candidate sentences than what reference sentences are received from MT metrics. Experimental results demonstrate that RESUME enhances the alignments between MT metrics and human judgments both at the segment-level and the system-level.
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF COMPUTING > DEPARTMENT OF ARTIFICIAL INTELLIGENCE > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Jung, Woohwan photo

Jung, Woohwan
ERICA 소프트웨어융합대학 (DEPARTMENT OF ARTIFICIAL INTELLIGENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE