An Intelligent Docent System with a Small Large Language Model (sLLM) Based on Retrieval-Augmented Generation (RAG)

Jung, Taemoon; Joe, Inwhee

doi:10.3390/app15179398

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

An Intelligent Docent System with a Small Large Language Model (sLLM) Based on Retrieval-Augmented Generation (RAG)

Full metadata record

DC Field	Value	Language
dc.contributor.author	Jung, Taemoon	-
dc.contributor.author	Joe, Inwhee	-
dc.date.accessioned	2025-10-02T01:30:31Z	-
dc.date.available	2025-10-02T01:30:31Z	-
dc.date.issued	2025-08	-
dc.identifier.issn	2076-3417	-
dc.identifier.issn	2076-3417	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/208858	-
dc.description.abstract	This study designed and empirically evaluated a method to enhance information accessibility for museum and art gallery visitors using a small Large Language Model (sLLM) based on the Retrieval-Augmented Generation (RAG) framework. Over 199,000 exhibition descriptions were collected and refined, and a question-answering dataset consisting of 102,000 pairs reflecting user personas was constructed to develop DocentGemma, a domain-optimized language model. This model was fine-tuned through Low-Rank Adaptation (LoRA) based on Google's Gemma2-9B and integrated with FAISS and OpenSearch-based document retrieval systems within the LangChain framework. Performance evaluation was conducted using a dedicated Q&A benchmark for the docent domain, comparing the model against five commercial and open-source LLMs (including GPT-3.5 Turbo, LLaMA3.3-70B, and Gemma2-9B). DocentGemma achieved an accuracy of 85.55% and a perplexity of 3.78, demonstrating competitive performance in language generation and response accuracy within the domain-specific context. To enhance retrieval relevance, a Spatio-Contextual Retriever (SC-Retriever) was introduced, which combines semantic similarity and spatial proximity based on the user's query and location. An ablation study confirmed that integrating both modalities improved retrieval quality, with the SC-Retriever achieving a recall@1 of 53.45% and a Mean Reciprocal Rank (MRR) of 68.12, representing a 17.5 20% gain in search accuracy compared to baseline models such as GTE and SpatialNN. System performance was further validated through field deployment at three major exhibition venues in Seoul (the Seoul History Museum, the Hwan-ki Museum, and the Hanseong Baekje Museum). A user test involving 110 participants indicated high response credibility and an average satisfaction score of 4.24. To ensure accessibility, the system supports various output formats, including multilingual speech and subtitles. This work illustrates a practical application of integrating LLM-based conversational capabilities into traditional docent services and suggests potential for further development toward location-aware interactive systems and AI-driven cultural content services.	-
dc.format.extent	30	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	MDPI	-
dc.title	An Intelligent Docent System with a Small Large Language Model (sLLM) Based on Retrieval-Augmented Generation (RAG)	-
dc.type	Article	-
dc.publisher.location	스위스	-
dc.identifier.doi	10.3390/app15179398	-
dc.identifier.scopusid	2-s2.0-105015490312	-
dc.identifier.wosid	001569574300001	-
dc.identifier.bibliographicCitation	Applied Sciences-basel, v.15, no.17, pp 1 - 30	-
dc.citation.title	Applied Sciences-basel	-
dc.citation.volume	15	-
dc.citation.number	17	-
dc.citation.startPage	1	-
dc.citation.endPage	30	-
dc.type.docType	Article	-
dc.description.isOpenAccess	Y	-
dc.description.journalRegisteredClass	scie	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Chemistry	-
dc.relation.journalResearchArea	Engineering	-
dc.relation.journalResearchArea	Materials Science	-
dc.relation.journalResearchArea	Physics	-
dc.relation.journalWebOfScienceCategory	Chemistry, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Engineering, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Materials Science, Multidisciplinary	-
dc.relation.journalWebOfScienceCategory	Physics, Applied	-
dc.subject.keywordPlus	TEXT-TO-SPEECH	-
dc.subject.keywordAuthor	small Large Language Model (sLLM)	-
dc.subject.keywordAuthor	Retrieval-Augmented Generation (RAG)	-
dc.subject.keywordAuthor	Intelligent Docent	-
dc.subject.keywordAuthor	Museum	-
dc.subject.keywordAuthor	LangChain	-
dc.subject.keywordAuthor	LoRA	-
dc.subject.keywordAuthor	HuggingFace	-
dc.subject.keywordAuthor	Natural Language Processing model (NLP)	-
dc.subject.keywordAuthor	Speech-to-Text (STT)	-
dc.subject.keywordAuthor	Ultra-Wideband (UWB)	-
dc.subject.keywordAuthor	Facebook AI Similarity Search (FAISS)	-
dc.identifier.url	https://www.mdpi.com/2076-3417/15/17/9398	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Show simple item record

qrcode

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Altmetrics

Total Views & Downloads

BROWSE