Detailed Information

Cited 8 time in webofscience Cited 13 time in scopus
Metadata Downloads

Multimodal Neural Machine Translation With Weakly Labeled Images

Full metadata record
DC Field Value Language
dc.contributor.authorHeo, Yoonseok-
dc.contributor.authorKang, Sangwoo-
dc.contributor.authorYoo, Donghyun-
dc.date.available2020-02-27T07:42:55Z-
dc.date.created2020-02-05-
dc.date.issued2019-
dc.identifier.issn2169-3536-
dc.identifier.urihttps://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/2873-
dc.description.abstractMachine translation refers to a fully automated process that translates a user's input text into a target language. To improve the accuracy of machine translation, studies usually exploit not only the input text itself but also various background knowledge related to the text, such as visual information or prior knowledge. Herein, in this paper, we propose a multimodal neural machine translation system that uses both texts and their related images to translate Korean image captions into English. The data in the experiment is a set of unlabeled images only containing bilingual captions. To train the system with a supervised learning approach, we propose a weak-labeling method that selects a keyword from an image caption using feature selection methods. The keywords are used to roughly determine an image label. We also introduce an improved feature selection method using sentence clustering to select keywords that reflect the characteristics of the image captions more accurately. We found that our multimodal system achieves an improved performance compared to a text-only neural machine translation system (baseline). Furthermore, the additional images have positive impacts on addressing the issue of under-translation, where some words in a source sentence are falsely translated or not translated at all.-
dc.language영어-
dc.language.isoen-
dc.publisherIEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC-
dc.relation.isPartOfIEEE ACCESS-
dc.subjectMODEL-
dc.titleMultimodal Neural Machine Translation With Weakly Labeled Images-
dc.typeArticle-
dc.type.rimsART-
dc.description.journalClass1-
dc.identifier.wosid000467272500001-
dc.identifier.doi10.1109/ACCESS.2019.2911656-
dc.identifier.bibliographicCitationIEEE ACCESS, v.7, pp.54042 - 54053-
dc.identifier.scopusid2-s2.0-85065386510-
dc.citation.endPage54053-
dc.citation.startPage54042-
dc.citation.titleIEEE ACCESS-
dc.citation.volume7-
dc.contributor.affiliatedAuthorKang, Sangwoo-
dc.type.docTypeArticle-
dc.subject.keywordAuthorHuman-computer interaction-
dc.subject.keywordAuthormulti-layer neural network-
dc.subject.keywordAuthornatural language processing-
dc.subject.keywordAuthorimage classification-
dc.subject.keywordAuthormultimodal neural machine translation-
dc.subject.keywordAuthorweak label-
dc.subject.keywordPlusMODEL-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaTelecommunications-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryTelecommunications-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 소프트웨어학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kang, Sang Woo photo

Kang, Sang Woo
College of IT Convergence (Department of Software)
Read more

Altmetrics

Total Views & Downloads

BROWSE