Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

NAS-TasNet: Neural Architecture Search for Time-Domain Speech Separation

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Joo-Hyun-
dc.contributor.authorChang, Joon-Hyuk-
dc.contributor.authorYang, Jae-Mo-
dc.contributor.authorMoon, Han-Gil-
dc.date.accessioned2024-12-20T06:36:55Z-
dc.date.available2024-12-20T06:36:55Z-
dc.date.issued2022-05-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/203085-
dc.description.abstractThe fully convolutional time-domain speech separation network (Conv-TasNet) has been used as a backbone model in various studies because of its structural excellence. To maximize the performance and efficiency of Conv-TasNet, we attempt to apply a neural architecture search (NAS). NAS is a branch of automated machine learning that automatically searches for an optimal model structure while minimizing human intervention. In this study, we introduce a candidate operation to define the search space of NAS for Conv-TasNet. In addition, we introduce a low computational cost NAS to overcome the limitations of the backbone model that consumes large GPU memory for training. Next, we determine the optimized separation module structures using two search strategies based on gradient descent and reinforcement learning. In addition, when NAS is simply applied, there is an imbalance in the updating of architecture parameters, which are NAS parameters. Therefore, we introduce an auxiliary loss method that is appropriate for the Conv-TasNet architecture for a balanced architecture parameter update of the entire model. Furthermore, we determine that the auxiliary loss technique mitigates the imbalance of architecture parameter updates and improves the separation accuracy.-
dc.format.extent13-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.titleNAS-TasNet: Neural Architecture Search for Time-Domain Speech Separation-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/ACCESS.2022.3176003-
dc.identifier.wosid000804631500001-
dc.identifier.bibliographicCitationIEEE ACCESS, v.10, pp 56031 - 56043-
dc.citation.titleIEEE ACCESS-
dc.citation.volume10-
dc.citation.startPage56031-
dc.citation.endPage56043-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.subject.keywordPlusENHANCEMENT-
dc.subject.keywordAuthorComputational modeling-
dc.subject.keywordAuthorComputer architecture-
dc.subject.keywordAuthorConvolution-
dc.subject.keywordAuthorDeep learning-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorTime-domain analysis-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorAutomated machine learning (AutoML)-
dc.subject.keywordAuthorconvolutional neural network (CNN)-
dc.subject.keywordAuthordeep learning-
dc.subject.keywordAuthorend-to-end-
dc.subject.keywordAuthorspeech processing-
dc.subject.keywordAuthorspeech separation-
dc.subject.keywordAuthorneural architecture search-
dc.subject.keywordAuthortime-domain speech separation-
dc.identifier.urlhttps://ieeexplore.ieee.org/document/9777717/-
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE