Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Juyeon-
dc.contributor.authorLee, Geon-
dc.contributor.authorKim, Taeuk-
dc.contributor.authorShin, Kijung-
dc.date.accessioned2025-09-10T00:30:31Z-
dc.date.available2025-09-10T00:30:31Z-
dc.date.issued2025-07-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208697-
dc.description.abstractEntity linking (EL) aligns textual mentions with their corresponding entities in a knowledge base, facilitating various applications such as semantic search and question answering. Recent advances in multimodal entity linking (MEL) have shown that combining text and images can reduce ambiguity and improve alignment accuracy. However, most existing MEL methods overlook the rich structural information available in the form of knowledge-graph (KG) triples. In this paper, we propose KGMEL, a novel framework that leverages KG triples to enhance MEL. Specifically, it operates in three stages: (1) Generation: Produces high-quality triples for each mention by employing vision-language models based on its text and images. (2) Retrieval: Learns joint mention-entity representations, via contrastive learning, that integrate text, images, and (generated or KG) triples to retrieve candidate entities for each mention. (3) Reranking: Refines the KG triples of the candidate entities and employs large language models to identify the best-matching entity for the mention. Extensive experiments on benchmark datasets demonstrate that KGMEL outperforms existing methods. Our code, datasets, and online appendix are available at: https://github.com/juyeonnn/KGMEL.-
dc.format.extent5-
dc.language영어-
dc.language.isoENG-
dc.publisherAssociation for Computing Machinery, Inc-
dc.titleKGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking-
dc.typeArticle-
dc.identifier.doi10.1145/3726302.3730217-
dc.identifier.scopusid2-s2.0-105011821176-
dc.identifier.wosid001587983900333-
dc.identifier.bibliographicCitationSIGIR 2025 - Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 3015 - 3019-
dc.citation.titleSIGIR 2025 - Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval-
dc.citation.startPage3015-
dc.citation.endPage3019-
dc.type.docTypeProceedings Paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Artificial Intelligence-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.relation.journalWebOfScienceCategoryComputer Science, Theory & Methods-
dc.subject.keywordPlusComputational linguistics-
dc.subject.keywordPlusComputer vision-
dc.subject.keywordPlusKnowledge graph-
dc.subject.keywordPlusKnowledge management-
dc.subject.keywordPlusLearning systems-
dc.subject.keywordPlusNatural language processing systems-
dc.subject.keywordPlusVisual languages-
dc.subject.keywordAuthorKnowledge Graph-
dc.subject.keywordAuthorMultimodal Entity Linking-
dc.subject.keywordAuthorMultimodal Knowledge Base-
dc.subject.keywordAuthorVision Language Models-
dc.subject.keywordAuthorComputational Linguistics-
dc.subject.keywordAuthorComputer Vision-
dc.subject.keywordAuthorKnowledge Graph-
dc.subject.keywordAuthorKnowledge Management-
dc.subject.keywordAuthorLearning Systems-
dc.subject.keywordAuthorNatural Language Processing Systems-
dc.subject.keywordAuthorVisual Languages-
dc.subject.keywordAuthorAlignment Accuracy-
dc.subject.keywordAuthorKnowledge Graphs-
dc.subject.keywordAuthorLanguage Model-
dc.subject.keywordAuthorMulti-modal-
dc.subject.keywordAuthorMultimodal Entity Linking-
dc.subject.keywordAuthorMultimodal Knowledge Base-
dc.subject.keywordAuthorQuestion Answering-
dc.subject.keywordAuthorSemantic Search-
dc.subject.keywordAuthorVision Language Model-
dc.subject.keywordAuthorSemantics-
dc.identifier.urlhttps://dl.acm.org/doi/10.1145/3726302.3730217-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Taeuk photo

Kim, Taeuk
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE