Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Two-stage generative adversarial networks for binarization of color document images

Full metadata record
DC Field Value Language
dc.contributor.authorSuh, Sungho-
dc.contributor.authorKim, Jihun-
dc.contributor.authorLukowicz, Paul-
dc.contributor.authorLee, Yong Oh-
dc.date.accessioned2024-04-16T02:32:19Z-
dc.date.available2024-04-16T02:32:19Z-
dc.date.issued2022-10-01-
dc.identifier.issn0031-3203-
dc.identifier.issn1873-5142-
dc.identifier.urihttps://scholarworks.bwise.kr/hongik/handle/2020.sw.hongik/32965-
dc.description.abstractDocument image enhancement and binarization methods are often used to improve the accuracy and efficiency of document image analysis tasks such as text recognition. Traditional non-machine-learning methods are constructed on low-level features in an unsupervised manner but have difficulty with binarization on documents with severely degraded backgrounds. Convolutional neural network (CNN)based methods focus only on grayscale images and on local textual features. In this paper, we propose a two stage color document image enhancement and binarization method using generative adversarial neural networks. In the first stage, four color-independent adversarial networks are trained to extract color foreground information from an input image for document image enhancement. In the second stage, two independent adversarial networks with global and local features are trained for image binarization of documents of variable size. For the adversarial neural networks, we formulate loss functions between a discriminator and generators having an encoder-decoder structure. Experimental results show that the proposed method achieves better performance than many classical and state-of-the-art algorithms over the Document Image Binarization Contest (DIBCO) datasets, the LRDE Document Binarization Dataset (LRDE DBD), and our shipping label image dataset. We plan to release the shipping label dataset as well as our implementation code at github.com/opensuh/DocumentBinarization/. (c) 2022 Published by Elsevier Ltd.-
dc.language영어-
dc.language.isoENG-
dc.publisherELSEVIER SCI LTD-
dc.titleTwo-stage generative adversarial networks for binarization of color document images-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.1016/j.patcog.2022.108810-
dc.identifier.scopusid2-s2.0-85131101041-
dc.identifier.wosid000808339300003-
dc.identifier.bibliographicCitationPATTERN RECOGNITION, v.130-
dc.citation.titlePATTERN RECOGNITION-
dc.citation.volume130-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.subject.keywordPlusENHANCEMENT-
dc.subject.keywordAuthorDocument image binarization-
dc.subject.keywordAuthorGenerative adversarial networks-
dc.subject.keywordAuthorOptical character recognition-
dc.subject.keywordAuthorColor document image enhancement-
Files in This Item
There are no files associated with this item.
Appears in
Collections
ETC > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Lee, Yong Oh photo

Lee, Yong Oh
Engineering (Department of Industrial and Data Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE