Detailed Information

Cited 20 time in webofscience Cited 26 time in scopus
Metadata Downloads

Packet Loss Concealment Based on Deep Neural Networks for Digital Speech Transmission

Full metadata record
DC Field Value Language
dc.contributor.authorLee, Bong-Ki-
dc.contributor.authorChang, Joon-Hyuk-
dc.date.accessioned2021-08-02T17:31:15Z-
dc.date.available2021-08-02T17:31:15Z-
dc.date.issued2016-02-
dc.identifier.issn2329-9290-
dc.identifier.issn2329-9304-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/24002-
dc.description.abstractIn this paper, we propose the regression-based packet loss concealment (PLC) for digital speech transmission by using deep neural networks (DNNs) with a multiple-layer deep architecture. For the DNN training, log-power spectra and phases are employed as features in the input layer for the large training set, which ensures non-linear mapping the frames from the last correctly received frame to the missing frame. Once the training is accomplished by the restricted Boltzmann machine (RBM)-based pre-training to initialize the DNN, minimum mean square error (MMSE)-based fine tuning is then performed based on the back-propagation algorithm. In the reconstruction stage, the trained DNN model is fed with the features of the previous frames in order to estimate the log-power spectra and phases of the missing frames. Reconstruction is further improved by using the cross-fading technique to mitigate discontinuity between the reconstruction signal and good frame signal in the time-domain. To demonstrate the performance of the proposed algorithm, hidden Markov model (HMM)-based PLC algorithm and the PLC algorithm standardized in adaptive multi-rate wideband (AMR-WB) Appendix I were used for comparison. The experimental results show that the proposed approach provides better speech quality and speech recognition accuracy than the conventional approaches.-
dc.format.extent10-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEE Advancing Technology for Humanity-
dc.titlePacket Loss Concealment Based on Deep Neural Networks for Digital Speech Transmission-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1109/TASLP.2015.2509780-
dc.identifier.scopusid2-s2.0-84962840611-
dc.identifier.wosid000368453900001-
dc.identifier.bibliographicCitationIEEE/ACM Transactions on Audio, Speech, and Language Processing, v.24, no.2, pp 378 - 387-
dc.citation.titleIEEE/ACM Transactions on Audio, Speech, and Language Processing-
dc.citation.volume24-
dc.citation.number2-
dc.citation.startPage378-
dc.citation.endPage387-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaAcoustics-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryAcoustics-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusLOSS-RECOVERY TECHNIQUES-
dc.subject.keywordPlusCODING STANDARD-
dc.subject.keywordPlusQUALITY-
dc.subject.keywordPlusVOICE-
dc.subject.keywordPlusALGORITHM-
dc.subject.keywordPlusG.722-
dc.subject.keywordAuthorAdaptive multi-rate wideband-
dc.subject.keywordAuthordeep neural network (DNN)-
dc.subject.keywordAuthornetwork speech recognition-
dc.subject.keywordAuthorpacket loss concealment (PLC)-
dc.subject.keywordAuthorregression model-
dc.subject.keywordAuthorspeech quality-
Files in This Item
There are no files associated with this item.
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE