Cited 0 time in
Diffusion-based Target Device Style Transfer for Robust Acoustic Scene Classification
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Choi, Won-Gook | - |
| dc.contributor.author | Chang, Joon-Hyuk | - |
| dc.date.accessioned | 2025-07-22T03:00:12Z | - |
| dc.date.available | 2025-07-22T03:00:12Z | - |
| dc.date.issued | 2025-03 | - |
| dc.identifier.issn | 0736-7791 | - |
| dc.identifier.issn | 1520-6149 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208303 | - |
| dc.description.abstract | Audio signal processing systems often operate differently depending on the recording devices, leading to performance discrepancies. Therefore, it is important to know about the characteristics of the recording device; however, it is difficult to know the device's behavior in most cases. In this study, we propose a diffusion-model-based device characteristic transfer to estimate the device's frequency response only with the recorded signals. By joint-training the conditional and unconditional diffusion models, it is found that non-linear distortions and some filtered signals are reflected more than by only training the conditional model. We show that the proposed method transfers the style closely to the ground truth not only visually on the spectrogram but also the t-distributed stochastic neighbor embedding distribution and the performance of the device classifier. We also show the proposed method enhancing the performance as a data augmentation method for acoustic scene classification. | - |
| dc.format.extent | 5 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
| dc.title | Diffusion-based Target Device Style Transfer for Robust Acoustic Scene Classification | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1109/ICASSP49660.2025.10888162 | - |
| dc.identifier.scopusid | 2-s2.0-105009595162 | - |
| dc.identifier.bibliographicCitation | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, pp 1 - 5 | - |
| dc.citation.title | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 5 | - |
| dc.type.docType | Conference paper | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordPlus | Acoustic signal processing | - |
| dc.subject.keywordPlus | Audio acoustics | - |
| dc.subject.keywordPlus | Audio recordings | - |
| dc.subject.keywordPlus | Audio signal processing | - |
| dc.subject.keywordPlus | Audio systems | - |
| dc.subject.keywordPlus | Classification (of information) | - |
| dc.subject.keywordPlus | Diffusion | - |
| dc.subject.keywordPlus | Recording instruments | - |
| dc.subject.keywordAuthor | Acoustic Signal Processing | - |
| dc.subject.keywordAuthor | Audio Acoustics | - |
| dc.subject.keywordAuthor | Audio Recordings | - |
| dc.subject.keywordAuthor | Audio Signal Processing | - |
| dc.subject.keywordAuthor | Audio Systems | - |
| dc.subject.keywordAuthor | Classification (of Information) | - |
| dc.subject.keywordAuthor | Diffusion | - |
| dc.subject.keywordAuthor | Recording Instruments | - |
| dc.subject.keywordAuthor | Audio Signal | - |
| dc.subject.keywordAuthor | Device Characteristics | - |
| dc.subject.keywordAuthor | Diffusion Model | - |
| dc.subject.keywordAuthor | Model-based Opc | - |
| dc.subject.keywordAuthor | Non-linear Distortions | - |
| dc.subject.keywordAuthor | Performance | - |
| dc.subject.keywordAuthor | Recorded Signals | - |
| dc.subject.keywordAuthor | Recording Devices | - |
| dc.subject.keywordAuthor | Scene Classification | - |
| dc.subject.keywordAuthor | Signal Processing Systems | - |
| dc.subject.keywordAuthor | Stochastic Systems | - |
| dc.identifier.url | https://ieeexplore.ieee.org/document/10888162 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
