A Framework for Understanding Unstructured Financial Documents Using RPA and Multimodal Approach
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Cho, Seongkuk | - |
dc.contributor.author | Moon, Jihoon | - |
dc.contributor.author | Bae, Junhyeok | - |
dc.contributor.author | Kang, Jiwon | - |
dc.contributor.author | Lee, Sangwook | - |
dc.date.accessioned | 2023-05-25T01:41:33Z | - |
dc.date.available | 2023-05-25T01:41:33Z | - |
dc.date.issued | 2023-02 | - |
dc.identifier.issn | 2079-9292 | - |
dc.identifier.issn | 2079-9292 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/sch/handle/2021.sw.sch/22449 | - |
dc.description.abstract | The financial business process worldwide suffers from huge dependencies upon labor and written documents, thus making it tedious and time-consuming. In order to solve this problem, traditional robotic process automation (RPA) has recently been developed into a hyper-automation solution by combining computer vision (CV) and natural language processing (NLP) methods. These solutions are capable of image analysis, such as key information extraction and document classification. However, they could improve on text-rich document images and require much training data for processing multilingual documents. This study proposes a multimodal approach-based intelligent document processing framework that combines a pre-trained deep learning model with traditional RPA used in banks to automate business processes from real-world financial document images. The proposed framework can perform classification and key information extraction on a small amount of training data and analyze multilingual documents. In order to evaluate the effectiveness of the proposed framework, extensive experiments were conducted using Korean financial document images. The experimental results show the superiority of the multimodal approach for understanding financial documents and demonstrate that adequate labeling can improve performance by up to about 15%. | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | MDPI AG | - |
dc.title | A Framework for Understanding Unstructured Financial Documents Using RPA and Multimodal Approach | - |
dc.type | Article | - |
dc.publisher.location | 스위스 | - |
dc.identifier.doi | 10.3390/electronics12040939 | - |
dc.identifier.scopusid | 2-s2.0-85149222260 | - |
dc.identifier.wosid | 000939294100001 | - |
dc.identifier.bibliographicCitation | Electronics (Basel), v.12, no.4 | - |
dc.citation.title | Electronics (Basel) | - |
dc.citation.volume | 12 | - |
dc.citation.number | 4 | - |
dc.type.docType | Article | - |
dc.description.isOpenAccess | Y | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalResearchArea | Engineering | - |
dc.relation.journalResearchArea | Physics | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
dc.relation.journalWebOfScienceCategory | Physics, Applied | - |
dc.subject.keywordAuthor | intelligent document processing | - |
dc.subject.keywordAuthor | visual-rich document understanding | - |
dc.subject.keywordAuthor | optical character recognition | - |
dc.subject.keywordAuthor | financial document analysis | - |
dc.subject.keywordAuthor | key information extraction | - |
dc.subject.keywordAuthor | image classification | - |
dc.subject.keywordAuthor | RPA | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
(31538) 22, Soonchunhyang-ro, Asan-si, Chungcheongnam-do, Republic of Korea+82-41-530-1114
COPYRIGHT 2021 by SOONCHUNHYANG UNIVERSITY ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.