Detailed Information

Cited 0 time in webofscience Cited 1 time in scopus
Metadata Downloads

DNN based multi-speaker speech synthesis with temporal auxiliary speaker ID embedding

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Junmo-
dc.contributor.authorSong, Kwangsub-
dc.contributor.authorNoh, Kyoungjin-
dc.contributor.authorPark, Tae-Jun-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2021-07-30T05:23:06Z-
dc.date.available2021-07-30T05:23:06Z-
dc.date.created2021-05-13-
dc.date.issued2019-05-
dc.identifier.issn0000-0000-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/4572-
dc.description.abstractIn this paper, multi speaker speech synthesis using speaker embedding is proposed. The proposed model is based on Tacotron network, but post-processing network of the model is modified with dilated convolution layers, which used in Wavenet architecture, to make it more adaptive to speech. The model can generate multi speaker voice with only one neural network model by giving auxiliary input data, speaker embedding, to the network. This model shows successful result for generating two speaker's voices without significant deterioration of speech quality.-
dc.language영어-
dc.language.isoen-
dc.publisherInstitute of Electrical and Electronics Engineers Inc.-
dc.titleDNN based multi-speaker speech synthesis with temporal auxiliary speaker ID embedding-
dc.typeArticle-
dc.contributor.affiliatedAuthorChang, Joon-Hyuk-
dc.identifier.doi10.23919/ELINFOCOM.2019.8706390-
dc.identifier.scopusid2-s2.0-85065886857-
dc.identifier.wosid000470015800014-
dc.identifier.bibliographicCitationICEIC 2019 - International Conference on Electronics, Information, and Communication, pp.1 - 4-
dc.relation.isPartOfICEIC 2019 - International Conference on Electronics, Information, and Communication-
dc.citation.titleICEIC 2019 - International Conference on Electronics, Information, and Communication-
dc.citation.startPage1-
dc.citation.endPage4-
dc.type.rimsART-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusDeep learning-
dc.subject.keywordPlusDeep neural networks-
dc.subject.keywordPlusDeterioration-
dc.subject.keywordPlusEmbeddings-
dc.subject.keywordPlusAuxiliary inputs-
dc.subject.keywordPlusNeural network model-
dc.subject.keywordPlusPost processing-
dc.subject.keywordPlusSequence to sequence-
dc.subject.keywordPlusSignificant deteriorations-
dc.subject.keywordPlusSpeaker id-
dc.subject.keywordPlusSpeech quality-
dc.subject.keywordPlusSpeech synthesis-
dc.subject.keywordAuthorDeep learning-
dc.subject.keywordAuthorMulti speaker speech synthesis-
dc.subject.keywordAuthorSequence to sequence-
dc.subject.keywordAuthorSpeech synthesis-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/8706390-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE