KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking

Kim, Juyeon; Lee, Geon; Kim, Taeuk; Shin, Kijung

doi:10.1145/3726302.3730217

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kim, Juyeon	-
dc.contributor.author	Lee, Geon	-
dc.contributor.author	Kim, Taeuk	-
dc.contributor.author	Shin, Kijung	-
dc.date.accessioned	2025-09-10T00:30:31Z	-
dc.date.available	2025-09-10T00:30:31Z	-
dc.date.issued	2025-07	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208697	-
dc.description.abstract	Entity linking (EL) aligns textual mentions with their corresponding entities in a knowledge base, facilitating various applications such as semantic search and question answering. Recent advances in multimodal entity linking (MEL) have shown that combining text and images can reduce ambiguity and improve alignment accuracy. However, most existing MEL methods overlook the rich structural information available in the form of knowledge-graph (KG) triples. In this paper, we propose KGMEL, a novel framework that leverages KG triples to enhance MEL. Specifically, it operates in three stages: (1) Generation: Produces high-quality triples for each mention by employing vision-language models based on its text and images. (2) Retrieval: Learns joint mention-entity representations, via contrastive learning, that integrate text, images, and (generated or KG) triples to retrieve candidate entities for each mention. (3) Reranking: Refines the KG triples of the candidate entities and employs large language models to identify the best-matching entity for the mention. Extensive experiments on benchmark datasets demonstrate that KGMEL outperforms existing methods. Our code, datasets, and online appendix are available at: https://github.com/juyeonnn/KGMEL.	-
dc.format.extent	5	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Association for Computing Machinery, Inc	-
dc.title	KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking	-
dc.type	Article	-
dc.identifier.doi	10.1145/3726302.3730217	-
dc.identifier.scopusid	2-s2.0-105011821176	-
dc.identifier.wosid	001587983900333	-
dc.identifier.bibliographicCitation	SIGIR 2025 - Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 3015 - 3019	-
dc.citation.title	SIGIR 2025 - Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval	-
dc.citation.startPage	3015	-
dc.citation.endPage	3019	-
dc.type.docType	Proceedings Paper	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Artificial Intelligence	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Computer Science, Theory & Methods	-
dc.subject.keywordPlus	Computational linguistics	-
dc.subject.keywordPlus	Computer vision	-
dc.subject.keywordPlus	Knowledge graph	-
dc.subject.keywordPlus	Knowledge management	-
dc.subject.keywordPlus	Learning systems	-
dc.subject.keywordPlus	Natural language processing systems	-
dc.subject.keywordPlus	Visual languages	-
dc.subject.keywordAuthor	Knowledge Graph	-
dc.subject.keywordAuthor	Multimodal Entity Linking	-
dc.subject.keywordAuthor	Multimodal Knowledge Base	-
dc.subject.keywordAuthor	Vision Language Models	-
dc.subject.keywordAuthor	Computational Linguistics	-
dc.subject.keywordAuthor	Computer Vision	-
dc.subject.keywordAuthor	Knowledge Graph	-
dc.subject.keywordAuthor	Knowledge Management	-
dc.subject.keywordAuthor	Learning Systems	-
dc.subject.keywordAuthor	Natural Language Processing Systems	-
dc.subject.keywordAuthor	Visual Languages	-
dc.subject.keywordAuthor	Alignment Accuracy	-
dc.subject.keywordAuthor	Knowledge Graphs	-
dc.subject.keywordAuthor	Language Model	-
dc.subject.keywordAuthor	Multi-modal	-
dc.subject.keywordAuthor	Multimodal Entity Linking	-
dc.subject.keywordAuthor	Multimodal Knowledge Base	-
dc.subject.keywordAuthor	Question Answering	-
dc.subject.keywordAuthor	Semantic Search	-
dc.subject.keywordAuthor	Vision Language Model	-
dc.subject.keywordAuthor	Semantics	-
dc.identifier.url	https://dl.acm.org/doi/10.1145/3726302.3730217	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kim, Taeuk photo

Kim, Taeuk: COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE