Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

W2V2-Light: A Lightweight Version of Wav2vec 2.0 for Automatic Speech Recognition

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Dong-Hyun-
dc.contributor.authorLee, Jae-Hong-
dc.contributor.authorMo, Ji-Hwan-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2022-12-20T06:24:54Z-
dc.date.available2022-12-20T06:24:54Z-
dc.date.created2022-11-02-
dc.date.issued2022-09-
dc.identifier.issn2308-457X-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/173086-
dc.description.abstractWav2vec 2.0 (W2V2) has shown remarkable speech recognition performance by pre-training only with unlabeled data and fine-tuning with a small amount of labeled data. However, the practical application of W2V2 is hindered by hardware memory limitations, as it contains 317 million parameters. To address this issue, we propose W2V2-Light, a lightweight version of W2V2. We introduce two simple sharing methods to reduce the memory consumption as well as the computational costs of W2V2. Compared to W2V2, our model has 91% lesser parameters and a speedup of 1.31 times with minor degradation in downstream task performance. Moreover, by quantifying the stability of representations, we provide an empirical insight into why our model is capable of maintaining competitive performance despite the significant reduction in memory.-
dc.language영어-
dc.language.isoen-
dc.publisherInternational Speech Communication Association-
dc.titleW2V2-Light: A Lightweight Version of Wav2vec 2.0 for Automatic Speech Recognition-
dc.typeArticle-
dc.contributor.affiliatedAuthorChang, Joon-Hyuk-
dc.identifier.doi10.21437/Interspeech.2022-10339-
dc.identifier.scopusid2-s2.0-85140087147-
dc.identifier.wosid000900724503042-
dc.identifier.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v.2022-September, pp.3038 - 3042-
dc.relation.isPartOfProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.titleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.volume2022-September-
dc.citation.startPage3038-
dc.citation.endPage3042-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaAcoustics-
dc.relation.journalResearchAreaAudiology & Speech-Language Pathology-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryAcoustics-
dc.relation.journalWebOfScienceCategoryAudiology & Speech-Language Pathology-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusSpeech recognition-
dc.subject.keywordPlusSupervised learning-
dc.subject.keywordPlusSpeech communication-
dc.subject.keywordPlusAttention alignment-
dc.subject.keywordPlusAutomatic speech recognition-
dc.subject.keywordPlusFine tuning-
dc.subject.keywordPlusLabeled data-
dc.subject.keywordPlusParameter sharing-
dc.subject.keywordPlusPre-training-
dc.subject.keywordPlusRepresentation learning-
dc.subject.keywordPlusSemi-supervised learning-
dc.subject.keywordPlusSpeech recognition performance-
dc.subject.keywordPlusUnlabeled data-
dc.subject.keywordAuthorattention alignment-
dc.subject.keywordAuthorAutomatic speech recognition-
dc.subject.keywordAuthorparameter sharing-
dc.subject.keywordAuthorrepresentation learning-
dc.subject.keywordAuthorsemi-supervised learning-
dc.identifier.urlhttps://www.isca-speech.org/archive/interspeech_2022/kim22l_interspeech.html-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE