Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Enhancing Speech Emotion Recognition Using Dual Feature Extraction Encoders

Full metadata record
DC Field Value Language
dc.contributor.authorPulatov, Ilkhomjon-
dc.contributor.authorOteniyazov, Rashid-
dc.contributor.authorMakhmudov, Fazliddin-
dc.contributor.authorCho, Young-Im-
dc.date.accessioned2023-08-28T01:42:22Z-
dc.date.available2023-08-28T01:42:22Z-
dc.date.created2023-08-25-
dc.date.issued2023-07-
dc.identifier.issn1424-8220-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/88904-
dc.description.abstractUnderstanding and identifying emotional cues in human speech is a crucial aspect of human-computer communication. The application of computer technology in dissecting and deciphering emotions, along with the extraction of relevant emotional characteristics from speech, forms a significant part of this process. The objective of this study was to architect an innovative framework for speech emotion recognition predicated on spectrograms and semantic feature transcribers, aiming to bolster performance precision by acknowledging the conspicuous inadequacies in extant methodologies and rectifying them. To procure invaluable attributes for speech detection, this investigation leveraged two divergent strategies. Primarily, a wholly convolutional neural network model was engaged to transcribe speech spectrograms. Subsequently, a cutting-edge Mel-frequency cepstral coefficient feature abstraction approach was adopted and integrated with Speech2Vec for semantic feature encoding. These dual forms of attributes underwent individual processing before they were channeled into a long short-term memory network and a comprehensive connected layer for supplementary representation. By doing so, we aimed to bolster the sophistication and efficacy of our speech emotion detection model, thereby enhancing its potential to accurately recognize and interpret emotion from human speech. The proposed mechanism underwent a rigorous evaluation process employing two distinct databases: RAVDESS and EMO-DB. The outcome displayed a predominant performance when juxtaposed with established models, registering an impressive accuracy of 94.8% on the RAVDESS dataset and a commendable 94.0% on the EMO-DB dataset. This superior performance underscores the efficacy of our innovative system in the realm of speech emotion recognition, as it outperforms current frameworks in accuracy metrics.-
dc.language영어-
dc.language.isoen-
dc.publisherMDPI-
dc.relation.isPartOfSENSORS-
dc.titleEnhancing Speech Emotion Recognition Using Dual Feature Extraction Encoders-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid001039064600001-
dc.identifier.doi10.3390/s23146640-
dc.identifier.bibliographicCitationSENSORS, v.23, no.14-
dc.description.isOpenAccessY-
dc.identifier.scopusid2-s2.0-85166019330-
dc.citation.titleSENSORS-
dc.citation.volume23-
dc.citation.number14-
dc.contributor.affiliatedAuthorPulatov, Ilkhomjon-
dc.contributor.affiliatedAuthorMakhmudov, Fazliddin-
dc.contributor.affiliatedAuthorCho, Young-Im-
dc.type.docTypeArticle-
dc.subject.keywordAuthorspeech emotion recognition-
dc.subject.keywordAuthorCNN-
dc.subject.keywordAuthorLSTM-
dc.subject.keywordAuthorfeature extraction-
dc.subject.keywordAuthorMFCC-
dc.subject.keywordAuthorspectrogram-
dc.subject.keywordPlusDEEP-
dc.subject.keywordPlusCLASSIFICATION-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaInstruments & Instrumentation-
dc.relation.journalWebOfScienceCategoryChemistry, Analytical-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryInstruments & Instrumentation-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 컴퓨터공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Cho, Young Im photo

Cho, Young Im
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE