Classification of human sounds using support vector machine with psychoacoustic data
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Ahmed, Shahzad | - |
dc.contributor.author | Jo, Hyun In | - |
dc.contributor.author | Jeon, Jin Yong | - |
dc.date.accessioned | 2022-07-11T15:45:31Z | - |
dc.date.available | 2022-07-11T15:45:31Z | - |
dc.date.created | 2021-05-14 | - |
dc.date.issued | 2018-07 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/149672 | - |
dc.description.abstract | This paper presents the classification of human sounds based on support vector machine (SVM) using psychoacoustic data. A scream classification model, with sounds of speech and screams indicating different acoustical characteristics, was investigated. Temporal changes were observed by evaluating the physical characteristics of waveforms and spectrograms with psychoacoustic parameters, including loudness and sharpness. Mel frequency cepstral coefficients were used to identify the spectral energy distribution of screams. Further, a Mel filter bank and frequency band filter were used to extract the high spectral energy, and differentiate between the lower and higher energy spectra. The classification accuracy was improved by combining the SVM with the psy-choacoustic parameters of scream sound. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | The International Institute of Acoustic and Vibration (IIAV) | - |
dc.title | Classification of human sounds using support vector machine with psychoacoustic data | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Jeon, Jin Yong | - |
dc.identifier.scopusid | 2-s2.0-85058797479 | - |
dc.identifier.bibliographicCitation | 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling, v.8, pp.4595 - 4599 | - |
dc.relation.isPartOf | 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling | - |
dc.citation.title | 25th International Congress on Sound and Vibration 2018, ICSV 2018: Hiroshima Calling | - |
dc.citation.volume | 8 | - |
dc.citation.startPage | 4595 | - |
dc.citation.endPage | 4599 | - |
dc.type.rims | ART | - |
dc.type.docType | Proceeding | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordPlus | Acoustics | - |
dc.subject.keywordPlus | Audition | - |
dc.subject.keywordPlus | Spectroscopy | - |
dc.subject.keywordPlus | Acoustical characteristics | - |
dc.subject.keywordPlus | Classification accuracy | - |
dc.subject.keywordPlus | Human sounds | - |
dc.subject.keywordPlus | Mel frequency cepstral co-efficient | - |
dc.subject.keywordPlus | Physical characteristics | - |
dc.subject.keywordPlus | Psychoacoustic parameters | - |
dc.subject.keywordPlus | Psychoacoustics | - |
dc.subject.keywordPlus | Spectral energy distribution | - |
dc.subject.keywordPlus | Support vector machines | - |
dc.subject.keywordAuthor | Human sound classification | - |
dc.subject.keywordAuthor | Mel frequency cepstral coefficients | - |
dc.subject.keywordAuthor | Psychoacoustics | - |
dc.subject.keywordAuthor | Support vector machine | - |
dc.identifier.url | http://toc.proceedings.com/40638webtoc.pdf | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365
COPYRIGHT © 2021 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.