Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Deep learning-based recognition system for pashto handwritten text: benchmark on PHTIopen access

Authors
Hussain, IbrarAhmad, RiazUllah, KhalilMuhammad, SirajElhassan, RashaSyed, Ikram
Issue Date
Mar-2024
Publisher
PEERJ INC
Keywords
Deep learning; Natural language processing; Optical character recognition; Pashto handwritten text imagebase
Citation
PEERJ COMPUTER SCIENCE, v.10
Journal Title
PEERJ COMPUTER SCIENCE
Volume
10
URI
https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/91146
DOI
10.7717/peerj-cs.1925
ISSN
2376-5992
2376-5992
Abstract
This article introduces a recognition system for handwritten text in the Pashto language, representing the first attempt to establish a baseline system using the Pashto Handwritten Text Imagebase (PHTI) dataset. Initially, the PHTI dataset underwent pre-processed to eliminate unwanted characters, subsequently, the dataset was divided into training 70%, validation 15%, and test sets 15%. The proposed recognition system is based on multi -dimensional long short-term memory (MD-LSTM) networks. A comprehensive empirical analysis was conducted to determine the optimal parameters for the proposed MD-LSTM architecture; Counter experiments were used to evaluate the performance of the proposed system comparing with the state-of-the-art models on the PHTI dataset. The novelty of our proposed model, compared to other state of the art models, lies in its hidden layer size (i.e., 10, 20, 80) and its Tanh layer size (i.e., 20, 40). The system achieves a Character Error Rate (CER) of 20.77% as a baseline on the test set. The top 20 confusions are reported to check the performance and limitations of the proposed model. The results highlight complications and future perspective of the Pashto language towards the digital transition.
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Syed, Ikram photo

Syed, Ikram
College of IT Convergence (Department of Software)
Read more

Altmetrics

Total Views & Downloads

BROWSE