Detailed Information

Cited 16 time in webofscience Cited 18 time in scopus
Metadata Downloads

Improvement of the end-to-end scene text recognition method for "text-to-speech" conversion

Full metadata record
DC Field Value Language
dc.contributor.authorMakhmudov, F.-
dc.contributor.authorMukhiddinov, M.-
dc.contributor.authorAkmalbek, Abdusalomov-
dc.contributor.authorAvazov, K.-
dc.contributor.authorKhamdamov, U.-
dc.contributor.authorCho, Young Im-
dc.date.available2021-01-06T03:40:51Z-
dc.date.created2020-11-13-
dc.date.issued2020-11-
dc.identifier.issn0219-6913-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/79619-
dc.description.abstractMethods for text detection and recognition in images of natural scenes have become an active research topic in computer vision and have obtained encouraging achievements over several benchmarks. In this paper, we introduce a robust yet simple pipeline that produces accurate and fast text detection and recognition for the Uzbek language in natural scene images using a fully convolutional network and the Tesseract OCR engine. First, the text detection step quickly predicts text in random orientations in full-color images with a single fully convolutional neural network, discarding redundant intermediate stages. Then, the text recognition step recognizes the Uzbek language, including both the Latin and Cyrillic alphabets, using a trained Tesseract OCR engine. Finally, the recognized text can be pronounced using the Uzbek language text-to-speech synthesizer. The proposed method was tested on the ICDAR 2013, ICDAR 2015 and MSRA-TD500 datasets, and it showed an advantage in efficiently detecting and recognizing text from natural scene images for assisting the visually impaired. © 2020-
dc.language영어-
dc.language.isoen-
dc.publisherWORLD SCIENTIFIC PUBL CO PTE LTD-
dc.relation.isPartOfInternational Journal of Wavelets, Multiresolution and Information Processing-
dc.titleImprovement of the end-to-end scene text recognition method for "text-to-speech" conversion-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000599931700007-
dc.identifier.doi10.1142/S0219691320500526-
dc.identifier.bibliographicCitationInternational Journal of Wavelets, Multiresolution and Information Processing, v.18, no.6-
dc.description.isOpenAccessN-
dc.identifier.scopusid2-s2.0-85095450703-
dc.citation.titleInternational Journal of Wavelets, Multiresolution and Information Processing-
dc.citation.volume18-
dc.citation.number6-
dc.contributor.affiliatedAuthorMakhmudov, F.-
dc.contributor.affiliatedAuthorMukhiddinov, M.-
dc.contributor.affiliatedAuthorAkmalbek, Abdusalomov-
dc.contributor.affiliatedAuthorAvazov, K.-
dc.contributor.affiliatedAuthorCho, Young Im-
dc.type.docTypeArticle-
dc.subject.keywordAuthorfully convolutional network-
dc.subject.keywordAuthornatural scene images-
dc.subject.keywordAuthoroptical character recognition-
dc.subject.keywordAuthorScene text detection-
dc.subject.keywordAuthortext recognition-
dc.subject.keywordAuthortext-to-speech synthesizer-
dc.subject.keywordAuthorvisually impaired-
dc.subject.keywordPlusCharacter recognition-
dc.subject.keywordPlusConvolution-
dc.subject.keywordPlusConvolutional neural networks-
dc.subject.keywordPlusEngines-
dc.subject.keywordPlusSpeech synthesis-
dc.subject.keywordPlusConvolutional networks-
dc.subject.keywordPlusFull color images-
dc.subject.keywordPlusIntermediate stage-
dc.subject.keywordPlusNatural scene images-
dc.subject.keywordPlusRandom orientations-
dc.subject.keywordPlusText recognition-
dc.subject.keywordPlusText to speech synthesizers-
dc.subject.keywordPlusVisually impaired-
dc.subject.keywordPlusSpeech recognition-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaMathematics-
dc.relation.journalWebOfScienceCategoryComputer Science, Software Engineering-
dc.relation.journalWebOfScienceCategoryMathematics, Interdisciplinary Applications-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 컴퓨터공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher ugli, Mukhiddinov Mukhriddin Nuriddin photo

ugli, Mukhiddinov Mukhriddin Nuriddin
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE