Selective Film Conditioning with CTC-Based ASR Probability for Speech Enhancement

양다희; Chang, Joon-Hyuk

doi:10.1109/ICASSP49357.2023.10096375

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Selective Film Conditioning with CTC-Based ASR Probability for Speech Enhancement

Full metadata record

DC Field	Value	Language
dc.contributor.author	양다희	-
dc.contributor.author	Chang, Joon-Hyuk	-
dc.date.accessioned	2024-11-28T10:31:10Z	-
dc.date.available	2024-11-28T10:31:10Z	-
dc.date.issued	2023-06	-
dc.identifier.issn	0736-7791	-
dc.identifier.issn	1520-6149	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/196127	-
dc.description.abstract	Enhancing speech quality and intelligibility for automatic speech recognition (ASR) plays an important role in modeling speech enhancement (SE) systems. However, improving the ASR performance by utilizing SE networks is not guaranteed, owing to the discrepancy in the training methods of the two systems. Therefore, recent studies have gradually incorporated ASR information into SE systems by jointly training ASR and SE systems. Although prior studies have improved the performance, they are inefficient because the two networks are combined and require large model sizes. To address this limitation, we propose an efficient way to use feature-wise linear modulation (FiLM) conditioning with CTC-based ASR probabilities for the SE system. The proposed model is designed by stacking a FiLM layer with selective learning on each temporal convolutional network of the SE estimation module. This allows the SE network to adaptively select ASR information based on the relationship between context and acoustic information. The proposed method improves SE and ASR performance, resulting in more robust results against noise with only a small increase in the number of parameters.	-
dc.format.extent	5	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Selective Film Conditioning with CTC-Based ASR Probability for Speech Enhancement	-
dc.type	Article	-
dc.publisher.location	미국	-
dc.identifier.doi	10.1109/ICASSP49357.2023.10096375	-
dc.identifier.scopusid	2-s2.0-85180535669	-
dc.identifier.bibliographicCitation	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5	-
dc.citation.title	ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings	-
dc.citation.startPage	1	-
dc.citation.endPage	5	-
dc.type.docType	Conference paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.subject.keywordPlus	Automatic speech recognition	-
dc.subject.keywordPlus	Feature-wise linear modulation conditioning	-
dc.subject.keywordPlus	Frame-wise CTC-based posterior probability	-
dc.subject.keywordPlus	Linear modulations	-
dc.subject.keywordPlus	Posterior probability	-
dc.subject.keywordPlus	Speech enhancement system	-
dc.subject.keywordPlus	Speech quality	-
dc.subject.keywordPlus	Speech recognition performance	-
dc.subject.keywordPlus	Speech recognition probability	-
dc.subject.keywordPlus	Training methods	-
dc.subject.keywordAuthor	FiLM conditioning	-
dc.subject.keywordAuthor	frame-wise CTC-based posterior probability	-
dc.subject.keywordAuthor	speech enhancement	-
dc.identifier.url	https://ieeexplore.ieee.org/document/10096375	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk: COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE