Detailed Information

Cited 1 time in webofscience Cited 2 time in scopus
Metadata Downloads

Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments

Full metadata record
DC Field Value Language
dc.contributor.authorNoh, Kyoungjin-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2021-08-02T09:28:50Z-
dc.date.available2021-08-02T09:28:50Z-
dc.date.created2021-05-12-
dc.date.issued2020-04-
dc.identifier.issn1424-8220-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/9920-
dc.description.abstractIn this paper, we propose joint optimization of deep neural network (DNN)-supported dereverberation and beamforming for the convolutional recurrent neural network (CRNN)-based sound event detection (SED) in multi-channel environments. First, the short-time Fourier transform (STFT) coefficients are calculated from multi-channel audio signals under the noisy and reverberant environments, which are then enhanced by the DNN-supported weighted prediction error (WPE) dereverberation with the estimated masks. Next, the STFT coefficients of the dereverberated multi-channel audio signals are conveyed to the DNN-supported minimum variance distortionless response (MVDR) beamformer in which DNN-supported MVDR beamforming is carried out with the source and noise masks estimated by the DNN. As a result, the single-channel enhanced STFT coefficients are shown at the output and tossed to the CRNN-based SED system, and then, the three modules are jointly trained by the single loss function designed for SED. Furthermore, to ease the difficulty of training a deep learning model for SED caused by the imbalance in the amount of data for each class, the focal loss is used as a loss function. Experimental results show that joint training of DNN-supported dereverberation and beamforming with the SED model under the supervision of focal loss significantly improves the performance under the noisy and reverberant environments.-
dc.language영어-
dc.language.isoen-
dc.publisherMDPI-
dc.titleJoint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments-
dc.typeArticle-
dc.contributor.affiliatedAuthorChang, Joon-Hyuk-
dc.identifier.doi10.3390/s20071883-
dc.identifier.scopusid2-s2.0-85082792638-
dc.identifier.wosid000537110500079-
dc.identifier.bibliographicCitationSENSORS, v.20, no.7, pp.1 - 13-
dc.relation.isPartOfSENSORS-
dc.citation.titleSENSORS-
dc.citation.volume20-
dc.citation.number7-
dc.citation.startPage1-
dc.citation.endPage13-
dc.type.rimsART-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaInstruments & Instrumentation-
dc.relation.journalWebOfScienceCategoryChemistry, Analytical-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryInstruments & Instrumentation-
dc.subject.keywordPlusRECOGNITION-
dc.subject.keywordAuthorsound event detection-
dc.subject.keywordAuthordereverberation-
dc.subject.keywordAuthoracoustic beamforming-
dc.subject.keywordAuthorconvolutional recurrent neural network-
dc.subject.keywordAuthorjoint optimization-
dc.identifier.urlhttps://www.mdpi.com/1424-8220/20/7/1883-
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE