오픈소스 OCR을 이용한 군수품 시험성적서 전산화 연구
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 백승현 | - |
dc.date.accessioned | 2025-09-30T08:00:18Z | - |
dc.date.available | 2025-09-30T08:00:18Z | - |
dc.date.issued | 2025-09 | - |
dc.identifier.issn | 1229-1889 | - |
dc.identifier.issn | 2287-9005 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/126563 | - |
dc.description.abstract | Purpose: This study aims to transform test reports in the defense industry into a structured database (DB) by leveraging open-source Optical Character Recognition (OCR) and following the DMADOV methodology for quality improvement. Methods: The research was conducted in two phases following the DMADOV procedure. First, a baseline system was developed using the open-source OCR engine Tesseract to create a text extraction program, with data structuring attempted via rule-based post-processing. Subsequently, to overcome the system's limitations, a multi-model pipeline, specifically PaddleOCR's PP-Structure, was applied to enhance structural recognition performance, including layout analysis and table recognition. The performance of both systems was comparatively verified through quantitative metrics and qualitative analysis. Results: The initial Tesseract-based model heavily relied on strict, rule-based post-processing to ultimately achieve a 100% data match rate, but this revealed the system's lack of scalability and flexibility. In contrast, the optimized system using the multi-model pipeline (PP-Structure) accurately recognized the document's structure and content without requiring separate, complex post-processing, demonstrating superior performance in both qualitative and quantitative aspects. Conclusion: This study clearly identified the limitations of a simple OCR engine and demonstrated that a multi-model pipeline is an effective alternative for the automated structuring of defense quality data. The findings provide a practical roadmap for system integration companies and their partners to build a big data-based quality information system. Furthermore, the study is significant in its proposal of data utilization strategies for the implementation of Defense Quality 4.0. | - |
dc.format.extent | 26 | - |
dc.language | 한국어 | - |
dc.language.iso | KOR | - |
dc.publisher | 한국품질경영학회 | - |
dc.title | 오픈소스 OCR을 이용한 군수품 시험성적서 전산화 연구 | - |
dc.title.alternative | A Study on the Computerization of Military Supplies Test Reports with Open Source OCR | - |
dc.type | Article | - |
dc.publisher.location | 대한민국 | - |
dc.identifier.doi | 10.7469/JKSQM.2025.53.3.435 | - |
dc.identifier.bibliographicCitation | 품질경영학회지, v.53, no.3, pp 435 - 460 | - |
dc.citation.title | 품질경영학회지 | - |
dc.citation.volume | 53 | - |
dc.citation.number | 3 | - |
dc.citation.startPage | 435 | - |
dc.citation.endPage | 460 | - |
dc.type.docType | 정기학술지(Article(Perspective Article포함)) | - |
dc.identifier.kciid | ART003244705 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | kci | - |
dc.subject.keywordAuthor | DMADOV | - |
dc.subject.keywordAuthor | Open Source | - |
dc.subject.keywordAuthor | OCR | - |
dc.subject.keywordAuthor | Military Supplies | - |
dc.subject.keywordAuthor | Test Report | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.