Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Trainable Adaptive Score Normalization for Automatic Speaker Verification

Authors
Choi, Jeong-HwanSeong, Ju-SeokJeoung, Ye-RinChang, Joon-Hyuk
Issue Date
Mar-2025
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Automatic speaker verification; back-end; fine-tuning; score normalization; speaker embedding
Citation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5
Pages
5
Indexed
SCOPUS
Journal Title
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Start Page
1
End Page
5
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208312
DOI
10.1109/ICASSP49660.2025.10890182
ISSN
0736-7791
1520-6149
Abstract
Adaptive S-norm (AS-norm) calibrates automatic speaker verification (ASV) scores by normalizing them utilize the scores of impostors which are similar to the input speaker. However, AS-norm does not involve any learning process, limiting its ability to provide appropriate regularization strength for various evaluation utterances. To address this limitation, we propose a trainable AS-norm (TAS-norm) that leverages learnable impostor embeddings (LIEs), which are used to compose the cohort. These LIEs are initialized to represent each speaker in a training dataset consisting of impostor speakers. Subsequently, LIEs are fine-tuned by simulating an ASV evaluation. We utilize a margin penalty during top-scoring IEs selection in fine-tuning to prevent non-impostor speakers from being selected. In our experiments with ECAPA-TDNN, the proposed TAS-norm observed 4.11% and 10.62% relative improvement in equal error rate and minimum detection cost function, respectively, on VoxCeleb1-O trial compared with standard AS-norm without using proposed LIEs. We further validated the effectiveness of the TAS-norm on additional ASV datasets comprising Persian and Chinese, demonstrating its robustness across different languages.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE