Investigation of DNN based feature enhancement jointly trained with x-vectors for noise-robust speaker verification

Yang, Joon-Young; Park, Kwan-Ho; Chang, Joon-Hyuk; Kim, Youngsam; Cho, Sangrae

doi:10.1109/ICEIC49074.2020.9051093

Detailed Information

Cited 0 time in webofscience

Cited 1 time in scopus

Metadata Downloads

Investigation of DNN based feature enhancement jointly trained with x-vectors for noise-robust speaker verification

Authors: Yang, Joon-Young; Park, Kwan-Ho; Chang, Joon-Hyuk; Kim, Youngsam; Cho, Sangrae

Issue Date: Jan-2020

Publisher: Institute of Electrical and Electronics Engineers Inc.

Keywords: Deep speaker embedding; Feature enhancement; Joint training; Speaker verification

Citation: 2020 International Conference on Electronics, Information, and Communication, ICEIC 2020, pp.1 - 5

Indexed: SCOPUS

Journal Title: 2020 International Conference on Electronics, Information, and Communication, ICEIC 2020

Start Page: 1

End Page: 5

URI: https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/4465

DOI: 10.1109/ICEIC49074.2020.9051093

Abstract: In this paper, we investigate the deep neural network (DNN) based feature enhancement as the denoising frontend of the x-vector speaker verification framework in noisy environments. Firstly, the feature enhancement DNN (FE-DNN) learns the mapping function from the noisy to the clean corpora on the frame-level acoustic feature domain, and then the x-vector network (XvectorNet) is trained on top of the enhanced features. Finally, the separately trained FE-DNN and the XvectorNet are serially concatenated and jointly trained under the supervision of cross-entropy loss. In addition., we adopt the logistic margin softmax layer for training the XvectorNet in order to obtain more discriminative speaker embeddings.

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE