General-purpose Adversarial Training for Enhanced Automatic Speech Recognition Model Generalization

Kim, Dohee; Shim, Daeyeol; Chang, Joon-Hyuk

doi:10.21437/Interspeech.2023-2389

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

General-purpose Adversarial Training for Enhanced Automatic Speech Recognition Model Generalization

Authors: Kim, Dohee; Shim, Daeyeol; Chang, Joon-Hyuk

Issue Date: Aug-2023

Keywords: adversarial training; data augmentation; speech recognition

Citation: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v.2023-August, pp 889 - 893

Pages: 5

Indexed: SCOPUS

Journal Title: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volume: 2023-August

Start Page: 889

End Page: 893

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/191796

DOI: 10.21437/Interspeech.2023-2389

ISSN: 1990-9772

Abstract: We present a new adversarial training method called General-purpose adversarial training (GPAT) that enhances the performance of automatic speech recognition models. In GPAT, we propose the followings: (1) a plausible adversarial examples converter (PAC); (2) a distribution matching regularization term (DM reg.). Compared to previous studies that directly compute gradients with respect to the input, PAC incorporates non-linearity to achieve performance improvement while eliminating the need for extra forward passes. Furthermore, unlike previous studies that use fixed norms, GPAT can generate similar yet diverse samples through DM reg. We demonstrate that the GPAT elevates the performance of various models on the LibriSpeech dataset. Specifically, by applying GPAT to the conformer model, we achieved 5.3% average relative improvements. With respect to the wav2vec 2.0 experiments, our method yielded a 2.0%/4.4% word error rate on the LibriSpeech test sets without a language model.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE