Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Image Manipulation Using Korean Translation and CLIP: Ko-CLIP

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Sieun-
dc.contributor.authorJoe, Inwhee-
dc.date.accessioned2023-11-14T08:26:30Z-
dc.date.available2023-11-14T08:26:30Z-
dc.date.created2023-10-11-
dc.date.issued2023-04-
dc.identifier.issn2367-3370-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/192229-
dc.description.abstractDeep Learning, a field of artistic intelligence (AI), is showing good results in natural language processing (NLP) and image processing classification. In the NLP field, in particular, the BERT-based model has become the main focus of the latest language model. It is a representative model that utilizes BERT pre-training and fine-tuning. Through the process of pre-training vast amounts of data and fine-tuning it, more natural NLP can be implemented. CLIP recently built a dataset with only web crawling without manual labeling to create a huge dataset that forms image-text pairs. With the CLIP Model, it tells you which image the input text is deeply related to. However, CLIP does not recognize Korean text when it is input, so it cannot accurately analyze it. In this paper, we propose to use the BERT Model of NLP and CLIP in the field of image processing to process images by receiving Korean text input. The Korean text is translated into English through the BERT Model and used as input text in the CLIP Model. The output that went through the two models reflected the contents of the Korean text. It can be seen that Output is related to the accuracy of Korean text.-
dc.language영어-
dc.language.isoen-
dc.publisherSpringer Science and Business Media Deutschland GmbH-
dc.titleImage Manipulation Using Korean Translation and CLIP: Ko-CLIP-
dc.typeArticle-
dc.contributor.affiliatedAuthorJoe, Inwhee-
dc.identifier.doi10.1007/978-3-031-35314-7_21-
dc.identifier.scopusid2-s2.0-85172735362-
dc.identifier.bibliographicCitationLecture Notes in Networks and Systems, v.724 LNNS, pp.222 - 230-
dc.relation.isPartOfLecture Notes in Networks and Systems-
dc.citation.titleLecture Notes in Networks and Systems-
dc.citation.volume724 LNNS-
dc.citation.startPage222-
dc.citation.endPage230-
dc.type.rimsART-
dc.type.docTypeConference paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusCharacter recognition-
dc.subject.keywordPlusDeep learning-
dc.subject.keywordPlusLearning algorithms-
dc.subject.keywordPlusNatural language processing systems-
dc.subject.keywordPlusTranslation (languages)-
dc.subject.keywordPlusWeb crawler-
dc.subject.keywordPlusComputer vision-
dc.subject.keywordAuthorComputer Vision-
dc.subject.keywordAuthorImage Processing-
dc.subject.keywordAuthorMachine Learning-
dc.subject.keywordAuthorNatural Language Processing-
dc.identifier.urlhttps://link.springer.com/chapter/10.1007/978-3-031-35314-7_21-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Joe, Inwhee photo

Joe, Inwhee
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE