Light-weight Frequency Information Aware Neural Network Architecture for Voice Spoofing Detection
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Choi, S. | - |
dc.contributor.author | Oh, S. | - |
dc.contributor.author | Yang, J. | - |
dc.contributor.author | Lee, Y. | - |
dc.contributor.author | Kwak, I.-Y. | - |
dc.date.accessioned | 2023-03-08T05:31:09Z | - |
dc.date.available | 2023-03-08T05:31:09Z | - |
dc.date.issued | 2022-08 | - |
dc.identifier.issn | 1051-4651 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/61234 | - |
dc.description.abstract | The voice assistant market is overgrowing, and mainstream services like Bixby (Samsung), Alexa (Amazon), and Siri (Apple) are quickly being upgraded to support advanced commands. Such capabilities make them lucrative targets for attackers to exploit. Voice spoofing attacks involve recording voice commands of a target victim and simply replaying them through a loudspeaker. The 2019 Automatic Speaker Verification Spoofing And Countermeasures Challenge(ASVspoof) competition aims to facilitate the design of highly accurate voice spoofing attack detection systems. However, most of the presented models do not take frequency-level modeling into account in their modeling architecture and do not consider model complexity. To design a light-weight system with frequency-level modeling, we propose two systems: 1) Double Depthwise Separable (DDWS) convolution and 2) BC-ResNet with max feature map (MFM) activation (BC-ResMax). We evaluate the accuracy by equal error rate (EER) using the ASVspoof 2019 dataset. Our single models of parallel DDWS, sequential DDWS, and BC-ResMax model achieved spoofing attack detection EER of 2.63%, 2.08% and 2.59% in the LA dataset, and 0.47%, 0.63% and 0.49% in the PA dataset, achieving comparable performance with other top ensemble systems from the competition. Furthermore, parallel DDWS, sequential DDWS, and BC-ResMax used only 45K, 28K and 29K numbers of parameters which are far fewer than existing models. © 2022 IEEE. | - |
dc.format.extent | 7 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Light-weight Frequency Information Aware Neural Network Architecture for Voice Spoofing Detection | - |
dc.type | Article | - |
dc.identifier.doi | 10.1109/ICPR56361.2022.9956079 | - |
dc.identifier.bibliographicCitation | Proceedings - International Conference on Pattern Recognition, v.2022-August, pp 477 - 483 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.wosid | 000897707600066 | - |
dc.identifier.scopusid | 2-s2.0-85141554264 | - |
dc.citation.endPage | 483 | - |
dc.citation.startPage | 477 | - |
dc.citation.title | Proceedings - International Conference on Pattern Recognition | - |
dc.citation.volume | 2022-August | - |
dc.type.docType | Conference Paper | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Imaging Science & Photographic Technology | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Imaging Science & Photographic Technology | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194
COPYRIGHT 2019 Chung-Ang University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.