Voice Spoofing Detection Through Residual Network, Max Feature Map, and Depthwise Separable Convolution

Kwak, Il-Youp; Kwag, Sungsu; Lee, Junhee; Jeon, Youngbae; Hwang, Jeonghwan; Choi, Hyo-Jung; Yang, Jong-Hoon; Han, So-Yul; Huh, Jun Ho; Lee, Choong-Hoon; Yoon, Ji Won

doi:10.1109/ACCESS.2023.3275790

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Voice Spoofing Detection Through Residual Network, Max Feature Map, and Depthwise Separable Convolution

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kwak, Il-Youp	-
dc.contributor.author	Kwag, Sungsu	-
dc.contributor.author	Lee, Junhee	-
dc.contributor.author	Jeon, Youngbae	-
dc.contributor.author	Hwang, Jeonghwan	-
dc.contributor.author	Choi, Hyo-Jung	-
dc.contributor.author	Yang, Jong-Hoon	-
dc.contributor.author	Han, So-Yul	-
dc.contributor.author	Huh, Jun Ho	-
dc.contributor.author	Lee, Choong-Hoon	-
dc.contributor.author	Yoon, Ji Won	-
dc.date.accessioned	2024-01-09T01:04:28Z	-
dc.date.available	2024-01-09T01:04:28Z	-
dc.date.issued	2023	-
dc.identifier.issn	2169-3536	-
dc.identifier.uri	https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/69709	-
dc.description.abstract	The goal of the '2019 Automatic Speaker Verification Spoofing and Countermeasures Challenge' (ASVspoof) was to make it easier to create systems that could identify voice spoofing attacks with high levels of accuracy. However, model complexity and latency requirements were not emphasized in the competition, despite the fact that they are stringent requirements for implementation in the real world. The majority of the top-performing solutions from the competition used an ensemble technique that merged numerous sophisticated deep learning models to maximize detection accuracy. Those approaches struggle with real-world deployment restrictions for voice assistants which would have restricted resources. We merged skip connection (from ResNet) and max feature map (from Light CNN) to create a compact system, and we tested its performance using the ASVspoof 2019 dataset. Our single model achieved a replay attack detection equal error rate (EER) of 0.30% on the evaluation set using an optimized constant Q transform (CQT) feature, outperforming the top ensemble system in the competition, which scored an EER of 0.39%. We experimented using depthwise separable convolutions (from MobileNet) to reduce model sizes; this resulted in an 84.3 percent reduction in parameter count (from 286K to 45K), while maintaining similar performance (EER of 0.36%). Additionally, we used Grad-CAM to clarify which spectrogram regions significantly contribute to the detection of fake data. © 2013 IEEE.	-
dc.format.extent	13	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	-
dc.title	Voice Spoofing Detection Through Residual Network, Max Feature Map, and Depthwise Separable Convolution	-
dc.type	Article	-
dc.identifier.doi	10.1109/ACCESS.2023.3275790	-
dc.identifier.bibliographicCitation	IEEE Access, v.11, pp 49140 - 49152	-
dc.description.isOpenAccess	Y	-
dc.identifier.wosid	001005689300001	-
dc.identifier.scopusid	2-s2.0-85161259283	-
dc.citation.endPage	49152	-
dc.citation.startPage	49140	-
dc.citation.title	IEEE Access	-
dc.citation.volume	11	-
dc.type.docType	Article	-
dc.publisher.location	미국	-
dc.subject.keywordAuthor	Voice assistant security	-
dc.subject.keywordAuthor	voice presentation attack detection	-
dc.subject.keywordAuthor	voice spoofing attack	-
dc.subject.keywordAuthor	voice synthesis attack	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Telecommunications	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Engineering, Electrical & Electronic	-
dc.relation.journalWebOfScienceCategory	Telecommunications	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-

Files in This Item

Voice Spoofing Detection Through Residual Network, Max Feature Map, and Depthwise Separable Convolution.pdf 4.22 MB

Appears in Collections: ETC > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kwak, Il-Youp photo

Kwak, Il-Youp: 대학원 (통계데이터사이언스학과)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,463,514; Today View :13,577

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE