Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments

Noh, Kyoungjin; Chang, Joon-Hyuk

doi:10.3390/s20071883

Detailed Information

Cited 1 time in webofscience

Cited 2 time in scopus

Metadata Downloads

Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments

Full metadata record

DC Field	Value	Language
dc.contributor.author	Noh, Kyoungjin	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2021-08-02T09:28:50Z	-
dc.date.available	2021-08-02T09:28:50Z	-
dc.date.created	2021-05-12	-
dc.date.issued	2020-04	-
dc.identifier.issn	1424-8220	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/9920	-
dc.description.abstract	In this paper, we propose joint optimization of deep neural network (DNN)-supported dereverberation and beamforming for the convolutional recurrent neural network (CRNN)-based sound event detection (SED) in multi-channel environments. First, the short-time Fourier transform (STFT) coefficients are calculated from multi-channel audio signals under the noisy and reverberant environments, which are then enhanced by the DNN-supported weighted prediction error (WPE) dereverberation with the estimated masks. Next, the STFT coefficients of the dereverberated multi-channel audio signals are conveyed to the DNN-supported minimum variance distortionless response (MVDR) beamformer in which DNN-supported MVDR beamforming is carried out with the source and noise masks estimated by the DNN. As a result, the single-channel enhanced STFT coefficients are shown at the output and tossed to the CRNN-based SED system, and then, the three modules are jointly trained by the single loss function designed for SED. Furthermore, to ease the difficulty of training a deep learning model for SED caused by the imbalance in the amount of data for each class, the focal loss is used as a loss function. Experimental results show that joint training of DNN-supported dereverberation and beamforming with the SED model under the supervision of focal loss significantly improves the performance under the noisy and reverberant environments.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	MDPI	-
dc.title	Joint Optimization of Deep Neural Network-Based Dereverberation and Beamforming for Sound Event Detection in Multi-Channel Environments	-
dc.type	Article	-
dc.contributor.affiliatedAuthor	Chang, Joon-Hyuk	-
dc.identifier.doi	10.3390/s20071883	-
dc.identifier.scopusid	2-s2.0-85082792638	-
dc.identifier.wosid	000537110500079	-
dc.identifier.bibliographicCitation	SENSORS, v.20, no.7, pp.1 - 13	-
dc.relation.isPartOf	SENSORS	-
dc.citation.title	SENSORS	-
dc.citation.volume	20	-
dc.citation.number	7	-
dc.citation.startPage	1	-
dc.citation.endPage	13	-
dc.type.rims	ART	-
dc.type.docType	Article	-
dc.description.journalClass	1	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Chemistry	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Instruments & Instrumentation	-
dc.relation.journalWebOfScienceCategory	Chemistry, Analytical	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Instruments & Instrumentation	-
dc.subject.keywordPlus	RECOGNITION	-
dc.subject.keywordAuthor	sound event detection	-
dc.subject.keywordAuthor	dereverberation	-
dc.subject.keywordAuthor	acoustic beamforming	-
dc.subject.keywordAuthor	convolutional recurrent neural network	-
dc.subject.keywordAuthor	joint optimization	-
dc.identifier.url	https://www.mdpi.com/1424-8220/20/7/1883	-

Files in This Item

sensors-20-01883.pdf 411.47 kB

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :5,996,966; Today View :24,270

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE