Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Diffusion-based Target Device Style Transfer for Robust Acoustic Scene Classification

Authors
Choi, Won-GookChang, Joon-Hyuk
Issue Date
Mar-2025
Publisher
Institute of Electrical and Electronics Engineers Inc.
Keywords
Acoustic Signal Processing; Audio Acoustics; Audio Recordings; Audio Signal Processing; Audio Systems; Classification (of Information); Diffusion; Recording Instruments; Audio Signal; Device Characteristics; Diffusion Model; Model-based Opc; Non-linear Distortions; Performance; Recorded Signals; Recording Devices; Scene Classification; Signal Processing Systems; Stochastic Systems
Citation
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5
Pages
5
Indexed
SCOPUS
Journal Title
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
Start Page
1
End Page
5
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208303
DOI
10.1109/ICASSP49660.2025.10888162
ISSN
0736-7791
1520-6149
Abstract
Audio signal processing systems often operate differently depending on the recording devices, leading to performance discrepancies. Therefore, it is important to know about the characteristics of the recording device; however, it is difficult to know the device's behavior in most cases. In this study, we propose a diffusion-model-based device characteristic transfer to estimate the device's frequency response only with the recorded signals. By joint-training the conditional and unconditional diffusion models, it is found that non-linear distortions and some filtered signals are reflected more than by only training the conditional model. We show that the proposed method transfers the style closely to the ground truth not only visually on the spectrogram but also the t-distributed stochastic neighbor embedding distribution and the performance of the device classifier. We also show the proposed method enhancing the performance as a data augmentation method for acoustic scene classification.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Chang, Joon-Hyuk photo

Chang, Joon-Hyuk
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE