Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A Self-Evaluated Bilingual Automatic Speech Recognition System for Mandarin–English Mixed Conversations

Full metadata record
DC Field Value Language
dc.contributor.authorHai, Xinhe-
dc.contributor.authorAranganadin, Kaviya-
dc.contributor.authorYeh, Cheng Cheng-
dc.contributor.authorHua, Zhengmao-
dc.contributor.authorHuang, Chenyun-
dc.contributor.authorHsu, Huayi-
dc.contributor.authorLin, M. C.-
dc.date.accessioned2025-09-10T02:30:24Z-
dc.date.available2025-09-10T02:30:24Z-
dc.date.issued2025-07-
dc.identifier.issn2076-3417-
dc.identifier.issn2076-3417-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208701-
dc.description.abstractBilingual communication is increasingly prevalent in this globally connected world, where cultural exchanges and international interactions are unavoidable. Existing automatic speech recognition (ASR) systems are often limited to single languages. However, the growing demand for bilingual ASR in human–computer interactions, particularly in medical services, has become indispensable. This article addresses this need by creating an application programming interface (API)-based platform using VOSK, a popular open-source single-language ASR toolkit, to efficiently deploy a self-evaluated bilingual ASR system that seamlessly handles both primary and secondary languages in tasks like Mandarin–English mixed-speech recognition. The mixed error rate (MER) is used as a performance metric, and a workflow is outlined for its calculation using the edit distance algorithm. Results show a remarkable reduction in the Mandarin–English MER, dropping from ∼65% to under 13%, after implementing the self-evaluation framework and mixed-language algorithms. These findings highlight the importance of a well-designed system to manage the complexities of mixed-language speech recognition, offering a promising method for building a bilingual ASR system using existing monolingual models. The framework might be further extended to a trilingual or multilingual ASR system by preparing mixed-language datasets and computer development without involving complex training.-
dc.format.extent19-
dc.language영어-
dc.language.isoENG-
dc.publisherMDPI-
dc.titleA Self-Evaluated Bilingual Automatic Speech Recognition System for Mandarin–English Mixed Conversations-
dc.title.alternativeA Self-Evaluated Bilingual Automatic Speech Recognition System for Mandarin-English Mixed Conversations-
dc.typeArticle-
dc.publisher.location스위스-
dc.identifier.doi10.3390/app15147691-
dc.identifier.scopusid2-s2.0-105011753018-
dc.identifier.wosid001535538800001-
dc.identifier.bibliographicCitationApplied Sciences-basel, v.15, no.14, pp 1 - 19-
dc.citation.titleApplied Sciences-basel-
dc.citation.volume15-
dc.citation.number14-
dc.citation.startPage1-
dc.citation.endPage19-
dc.type.docTypeArticle-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaChemistry-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaMaterials Science-
dc.relation.journalResearchAreaPhysics-
dc.relation.journalWebOfScienceCategoryChemistry, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryEngineering, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryMaterials Science, Multidisciplinary-
dc.relation.journalWebOfScienceCategoryPhysics, Applied-
dc.subject.keywordPlusMODELS-
dc.subject.keywordPlusINFORMATION-
dc.subject.keywordAuthorApi-
dc.subject.keywordAuthorAutomatic Speech Recognition-
dc.subject.keywordAuthorBilingual-
dc.subject.keywordAuthorMandarin–english-
dc.subject.keywordAuthorMixed Error Rate-
dc.subject.keywordAuthorComputer Systems Programming-
dc.subject.keywordAuthorHuman Computer Interaction-
dc.subject.keywordAuthorLinguistics-
dc.subject.keywordAuthorOpen Source Software-
dc.subject.keywordAuthorOpen Systems-
dc.subject.keywordAuthorSpeech Communication-
dc.subject.keywordAuthorSpeech Recognition-
dc.subject.keywordAuthorApplications Programming Interfaces-
dc.subject.keywordAuthorAutomatic Speech Recognition-
dc.subject.keywordAuthorAutomatic Speech Recognition System-
dc.subject.keywordAuthorBilinguals-
dc.subject.keywordAuthorComputer Interaction-
dc.subject.keywordAuthorError Rate-
dc.subject.keywordAuthorGrowing Demand-
dc.subject.keywordAuthorMandarin–english-
dc.subject.keywordAuthorMixed Error Rate-
dc.subject.keywordAuthorMixed Errors-
dc.subject.keywordAuthorApplication Programming Interfaces (api)-
dc.identifier.urlhttps://www.mdpi.com/2076-3417/15/14/7691-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 전기공학전공 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lin, Ming Chieh photo

Lin, Ming Chieh
COLLEGE OF ENGINEERING (MAJOR IN ELECTRICAL ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE