Cited 0 time in
Snip-Cache: A code snippet caching system for LLM-based command-driven IoT systems
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Song, Chiwon | - |
| dc.contributor.author | Kang, Sooyong | - |
| dc.date.accessioned | 2026-03-25T05:30:46Z | - |
| dc.date.available | 2026-03-25T05:30:46Z | - |
| dc.date.issued | 2026-03 | - |
| dc.identifier.issn | 2543-1536 | - |
| dc.identifier.issn | 2542-6605 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/211569 | - |
| dc.description.abstract | Large language models (LLMs) are widely used in real-time interface systems that process user commands. Despite their high output quality, the long response times and substantial operating costs undermine the practicality and sustainability of LLM-based services. Prompt caching is one of the optimization techniques introduced to mitigate the problem. It avoids redundant processing of repetitive prompts by caching and reusing the response for the same or similar prompts. However, such a static caching scheme has an intrinsic limitation, in terms of the reusability of results, due to the variety of expressions having the same semantics in real-world usage environments. In this paper, we introduce a new strategy for prompt caching, Snippet Caching, for LLM-based command-driven IoT systems to overcome the limitation. It perceives a command (prompt) as a function call with specific arguments. Instead of caching (input, output) pairs, it caches two simple code snippets that mimic LLM operations for each function. Based on the strategy, we design a novel prompt caching scheme, Snip-Cache, which generates code snippets with the help of LLMs. Experimental results show that Snip-Cache is significantly more beneficial to command-driven IoT systems than semantic caching schemes (GPTCache and vCache), in terms of response accuracy, response time, and token usage. | - |
| dc.format.extent | 24 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Elsevier B.V. | - |
| dc.title | Snip-Cache: A code snippet caching system for LLM-based command-driven IoT systems | - |
| dc.type | Article | - |
| dc.publisher.location | 네델란드 | - |
| dc.identifier.doi | 10.1016/j.iot.2025.101852 | - |
| dc.identifier.scopusid | 2-s2.0-105024911075 | - |
| dc.identifier.wosid | 001643716800001 | - |
| dc.identifier.bibliographicCitation | Internet of Things, v.36, pp 1 - 24 | - |
| dc.citation.title | Internet of Things | - |
| dc.citation.volume | 36 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 24 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Telecommunications | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Telecommunications | - |
| dc.subject.keywordAuthor | Command-driven system | - |
| dc.subject.keywordAuthor | IoT system | - |
| dc.subject.keywordAuthor | LLM | - |
| dc.subject.keywordAuthor | Prompt caching | - |
| dc.subject.keywordAuthor | Semantic caching | - |
| dc.identifier.url | https://www.sciencedirect.com/science/article/pii/S254266052500366X?via%3Dihub | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
