언어 모델 기반 음성 특징 추출을 활용한 생성 음성 탐지

김승민; 박소희; 최대선

doi:10.13089/JKIISC.2024.34.3.439

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

언어 모델 기반 음성 특징 추출을 활용한 생성 음성 탐지

Full metadata record

DC Field	Value	Language
dc.contributor.author	김승민	-
dc.contributor.author	박소희	-
dc.contributor.author	최대선	-
dc.date.accessioned	2024-07-02T07:00:31Z	-
dc.date.available	2024-07-02T07:00:31Z	-
dc.date.issued	2024-06	-
dc.identifier.issn	1598-3986	-
dc.identifier.issn	2288-2715	-
dc.identifier.uri	https://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/49833	-
dc.description.abstract	최근 음성 생성 기술의 급격한 발전으로, 텍스트만으로도 자연스러운 음성 합성이 가능해졌다. 이러한 발전은 타인의 음성을 생성하여 범죄에 이용하는 보이스피싱과 같은 악용 사례를 증가시키는 결과를 낳고 있다. 음성 생성 여부를 탐지하는 모델은 많이 개발되고 있으며, 일반적으로 음성의 특징을 추출하고 이러한 특징을 기반으로 음성 생성 여부를 탐지한다. 본 논문은 생성 음성으로 인한 악용 사례에 대응하기 위해 새로운 음성 특징 추출 모델을 제안한다. 오디오를 입력으로 받는 딥러닝 기반 오디오 코덱 모델과 사전 학습된 자연어 처리 모델인 BERT를 사용하여 새로운 음성 특징 추출 모델을 제안하였다. 본 논문이 제안한 음성 특징 추출 모델이 음성 탐지에 적합한지 확인하기 위해 추출된 특징을 활용하여 4가지 생성 음성 탐지 모델을 만들어 성능평가를 진행하였다. 성능 비교를 위해 기존 논문에서 제안한 Deepfeature 기반의 음성 탐지 모델 3개와 그 외 모델과 정확도 및 EER을 비교하였다. 제안한 모델은 88.08%로 기존 모델보다 높은 정확도와 11.79%의 낮은 EER을 보였다. 이를 통해 본 논문에서 제안한 음성 특징 추출 방법이 생성 음성과 실제 음성을 판별하는 효과적인 도구로 사용될 수 있음을 확인하였다.	-
dc.format.extent	11	-
dc.language	한국어	-
dc.language.iso	KOR	-
dc.publisher	한국정보보호학회	-
dc.title	언어 모델 기반 음성 특징 추출을 활용한 생성 음성 탐지	-
dc.title.alternative	Voice Synthesis Detection Using Language Model-Based Speech Feature Extraction	-
dc.type	Article	-
dc.identifier.doi	10.13089/JKIISC.2024.34.3.439	-
dc.identifier.bibliographicCitation	정보보호학회논문지, v.34, no.3, pp 439 - 449	-
dc.identifier.kciid	ART003089524	-
dc.citation.endPage	449	-
dc.citation.number	3	-
dc.citation.startPage	439	-
dc.citation.title	정보보호학회논문지	-
dc.citation.volume	34	-
dc.identifier.url	https://www.kci.go.kr/kciportal/ci/sereArticleSearch/ciSereArtiView.kci?sereArticleSearchBean.artiId=ART003089524	-
dc.publisher.location	대한민국	-
dc.description.isOpenAccess	N	-
dc.subject.keywordAuthor	BERT	-
dc.subject.keywordAuthor	Audio codec	-
dc.subject.keywordAuthor	Voice Features Extraction	-
dc.subject.keywordAuthor	Speech Synthesis	-
dc.subject.keywordAuthor	Generated voice detection	-
dc.description.journalRegisteredClass	kci	-

Files in This Item: Go to Link

Appears in Collections: College of Information Technology > School of Software > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Choi, Daeseon photo

Choi, Daeseon: College of Information Technology (School of Software)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,404,414; Today View :2,755

RSS_1.0 RSS_2.0 ATOM_1.0

Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE