Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

One-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Jaeuk-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2022-12-20T06:25:00Z-
dc.date.available2022-12-20T06:25:00Z-
dc.date.created2022-11-02-
dc.date.issued2022-09-
dc.identifier.issn2308-457X-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/173087-
dc.description.abstractSpeaker adaptation for personalizing text-to-speech (TTS) has become increasingly important. Herein, we propose a novel adaptation using a few seconds of data obtained from an unseen speaker. We first use a speaker embedding lookup table to train a multi-speaker TTS model, wherein each speaker embedding in the lookup table contains information representing a speaker's timbre. We propose an initial embedding predictor that extracts initial embedding suitable for the adaptation of unseen speakers. We use trained speaker embeddings to train the initial embedding predictor. Further, adversarial training is applied to improve the performance. After adversarial training, the initial embedding predictor infers the unseen speaker's initial embedding, and it is fine-tuned. As the initial embedding contains timbre information of the unseen speaker, adaptation is achieved faster and with less data than with conventional methods. We validate the performance with a mean opinion score (MOS) and demonstrate that adaptation is feasible with only 5 s of data.-
dc.language영어-
dc.language.isoen-
dc.publisherInternational Speech Communication Association-
dc.titleOne-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS-
dc.typeArticle-
dc.contributor.affiliatedAuthorChang, Joon-Hyuk-
dc.identifier.doi10.21437/Interspeech.2022-934-
dc.identifier.scopusid2-s2.0-85140087242-
dc.identifier.wosid000900724503030-
dc.identifier.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v.2022-September, pp.2978 - 2982-
dc.relation.isPartOfProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.titleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.volume2022-September-
dc.citation.startPage2978-
dc.citation.endPage2982-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaAcoustics-
dc.relation.journalResearchAreaAudiology & Speech-Language Pathology-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryAcoustics-
dc.relation.journalWebOfScienceCategoryAudiology & Speech-Language Pathology-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusGenerative adversarial networks-
dc.subject.keywordPlusSpeech communication-
dc.subject.keywordPlusTable lookup-
dc.subject.keywordPlusEmbeddings-
dc.subject.keywordPlusConventional methods-
dc.subject.keywordPlusEmbeddings-
dc.subject.keywordPlusMean opinion scores-
dc.subject.keywordPlusMulti-speaker-
dc.subject.keywordPlusPerformance-
dc.subject.keywordPlusSpeaker adaptation-
dc.subject.keywordPlusSpeech models-
dc.subject.keywordPlusText to speech-
dc.subject.keywordPlusVoice cloning-
dc.subject.keywordAuthormulti-speaker-
dc.subject.keywordAuthorspeaker adaptation-
dc.subject.keywordAuthorvoice cloning-
dc.identifier.urlhttps://www.isca-speech.org/archive/interspeech_2022/lee22h_interspeech.html-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE