Proper Error Estimation and Calibration for Attention-Based Encoder-Decoder Models

Lee, Mun-Hak; Chang, Joon-Hyuk

doi:10.1109/TASLP.2024.3492799

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Proper Error Estimation and Calibration for Attention-Based Encoder-Decoder Models

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lee, Mun-Hak	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2024-12-06T05:30:18Z	-
dc.date.available	2024-12-06T05:30:18Z	-
dc.date.issued	2024-11	-
dc.identifier.issn	2329-9290	-
dc.identifier.issn	2329-9304	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/202075	-
dc.description.abstract	An attention-based automatic speech recognition (ASR) model generates a probability distribution of the tokens set at each time step. Recent studies have shown that calibration errors exist in the output probability distributions of attention-based ASR models trained to minimize the negative log likelihood. This study analyzes the causes of calibration errors in ASR model outputs and their impact on model performance. Based on the analysis, we argue that conventional methods for estimating calibration errors at the token level are unsuitable for ASR tasks. Accordingly, we propose a new calibration measure that estimates the calibration error at the sequence level. Moreover, we present a new post-hoc calibration function and training objective to mitigate the calibration error of the ASR model at the sequence level. Through experiments using the ASR benchmark, we show that the proposed methods effectively alleviate the calibration error of the ASR model and improve the generalization performance.	-
dc.format.extent	12	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	IEEE Advancing Technology for Humanity	-
dc.title	Proper Error Estimation and Calibration for Attention-Based Encoder-Decoder Models	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/TASLP.2024.3492799	-
dc.identifier.scopusid	2-s2.0-85209104995	-
dc.identifier.wosid	001361960400006	-
dc.identifier.bibliographicCitation	IEEE/ACM Transactions on Audio, Speech, and Language Processing, v.32, pp 4919 - 4930	-
dc.citation.title	IEEE/ACM Transactions on Audio, Speech, and Language Processing	-
dc.citation.volume	32	-
dc.citation.startPage	4919	-
dc.citation.endPage	4930	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Acoustics	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalWebOfScienceCategory	Acoustics	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.subject.keywordPlus	Encoding (symbols)	-
dc.subject.keywordPlus	Signal encoding	-
dc.subject.keywordPlus	Speech recognition	-
dc.subject.keywordAuthor	Calibration	-
dc.subject.keywordAuthor	Probability distribution	-
dc.subject.keywordAuthor	Decoding	-
dc.subject.keywordAuthor	Accuracy	-
dc.subject.keywordAuthor	Training	-
dc.subject.keywordAuthor	Analytical models	-
dc.subject.keywordAuthor	Measurement uncertainty	-
dc.subject.keywordAuthor	Error analysis	-
dc.subject.keywordAuthor	Data models	-
dc.subject.keywordAuthor	Speech processing	-
dc.subject.keywordAuthor	Speech recognition	-
dc.subject.keywordAuthor	calibration	-
dc.subject.keywordAuthor	post-hoc calibration methods	-
dc.subject.keywordAuthor	attention-base encoder decoder	-
dc.subject.keywordAuthor	sequence-level training	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10745647	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE