Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Min-Kyung-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2022-12-20T06:25:03Z-
dc.date.available2022-12-20T06:25:03Z-
dc.date.created2022-11-02-
dc.date.issued2022-09-
dc.identifier.issn2308-457X-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/173088-
dc.description.abstractThis study presents a method for improving the performance of the text-to-speech (TTS) model by using three global speech-style representations: language, speaker, and prosody. Synthesizing different languages and prosody in the speaker's voice regardless of their own language and prosody is possible. To construct the embedding of each representation conditioned in the TTS model such that it is independent of the other representations, we propose an adversarial training method for the general architecture of TTS models. Furthermore, we introduce a sequential training method that includes rehearsal-based continual learning to train complex and small amounts of data without forgetting previously learned information. The experimental results show that the proposed method can generate good-quality speech and yield high similarity for speakers and prosody, even for representations that the speaker in the dataset does not contain.-
dc.language영어-
dc.language.isoen-
dc.publisherInternational Speech Communication Association-
dc.titleAdversarial and Sequential Training for Cross-lingual Prosody Transfer TTS-
dc.typeArticle-
dc.contributor.affiliatedAuthorChang, Joon-Hyuk-
dc.identifier.doi10.21437/Interspeech.2022-865-
dc.identifier.scopusid2-s2.0-85140092669-
dc.identifier.wosid000900724504148-
dc.identifier.bibliographicCitationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, v.2022-September, pp.4556 - 4560-
dc.relation.isPartOfProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.titleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.citation.volume2022-September-
dc.citation.startPage4556-
dc.citation.endPage4560-
dc.type.rimsART-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaAcoustics-
dc.relation.journalResearchAreaAudiology & Speech-Language Pathology-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryAcoustics-
dc.relation.journalWebOfScienceCategoryAudiology & Speech-Language Pathology-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusAdversarial training-
dc.subject.keywordPlusContinual learning-
dc.subject.keywordPlusCross-lingual-
dc.subject.keywordPlusPerformance-
dc.subject.keywordPlusProsody-
dc.subject.keywordPlusRepresentation languages-
dc.subject.keywordPlusSpeech models-
dc.subject.keywordPlusSpeech style-
dc.subject.keywordPlusText to speech-
dc.subject.keywordPlusTraining methods-
dc.subject.keywordPlusSpeech communication-
dc.subject.keywordAuthoradversarial training-
dc.subject.keywordAuthorcontinual learning-
dc.subject.keywordAuthorcross-lingual-
dc.subject.keywordAuthorprosody-
dc.subject.keywordAuthortext-to-speech-
dc.identifier.urlhttps://www.isca-speech.org/archive/interspeech_2022/kim22g_interspeech.html-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE