Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Hardware-software Co-design for Vector Similarity Search on HBM-PIM

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Nahyeon-
dc.contributor.authorKim, Sujin-
dc.contributor.authorJung, Min-
dc.contributor.authorNoh, Haechannuri-
dc.contributor.authorKim, Ji-Hoon-
dc.date.accessioned2026-01-19T06:00:24Z-
dc.date.available2026-01-19T06:00:24Z-
dc.date.issued2025-12-
dc.identifier.issn1598-1657-
dc.identifier.issn2233-4866-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210365-
dc.description.abstractVector similarity search is a key component of Retrieval-Augmented Generation (RAG) for large language models (LLMs), requiring memory-intensive computations such as Manhattan distance, Euclidean distance, and cosine similarity. Processing-In-Memory (PIM) architectures offer a promising solution to accelerate these memory-bound operations by reducing data movement between memory and processor. This study presents a hardware-software co-design approach for optimizing distance computation on PIM. We first implemented and evaluated a vector similarity search application on a DRAM-based PIM platform using the developed computation library, achieving 44.2% and 59.0% speed improvements for Euclidean distance and cosine similarity, respectively, compared to the CPU. However, instruction set limitations led to performance bottlenecks despite software-level optimization. To address this, we utilized an HBM-based PIM simulator and proposed two new instructions, AMC and MAN, optimized for Euclidean and Manhattan distance computations. Evaluation using a simulator integrated with DRAMSim2 showed that the proposed instructions reduced the total cycle count for distance computations by up to 44% compared to the baseline, with performance gains increasing for larger input sizes. These results demonstrate that both software-level and instruction-level optimizations are essential to fully exploit the performance potential of PIM architectures for distance computation workloads.-
dc.format.extent8-
dc.language영어-
dc.language.isoENG-
dc.publisherIEEK PUBLICATION CENTER-
dc.titleHardware-software Co-design for Vector Similarity Search on HBM-PIM-
dc.typeArticle-
dc.publisher.location대한민국-
dc.identifier.doi10.5573/JSTS.2025.25.6.662-
dc.identifier.scopusid2-s2.0-105026859555-
dc.identifier.wosid001647850800005-
dc.identifier.bibliographicCitationJOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, v.25, no.6, pp 662 - 669-
dc.citation.titleJOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE-
dc.citation.volume25-
dc.citation.number6-
dc.citation.startPage662-
dc.citation.endPage669-
dc.identifier.kciidART003274361-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.description.journalRegisteredClasskci-
dc.relation.journalResearchAreaEngineering-
dc.relation.journalResearchAreaPhysics-
dc.relation.journalWebOfScienceCategoryEngineering, Electrical & Electronic-
dc.relation.journalWebOfScienceCategoryPhysics, Applied-
dc.subject.keywordPlusApplication programs-
dc.subject.keywordPlusData reduction-
dc.subject.keywordPlusDynamic random access storage-
dc.subject.keywordPlusIntegrated circuit design-
dc.subject.keywordPlusMemory architecture-
dc.subject.keywordPlusSoftware design-
dc.subject.keywordPlusVectors-
dc.subject.keywordAuthorProcessing-in-memory (PIM)-
dc.subject.keywordAuthorretrieval-augmented generation (RAG)-
dc.subject.keywordAuthorvector similarity search-
dc.subject.keywordAuthordistance computation-
dc.subject.keywordAuthorinstruction set extension-
dc.subject.keywordAuthorhardware-software co-design-
dc.subject.keywordAuthorPIM simulator-
dc.identifier.urlhttps://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12497868&buildDate=2026-01-06+10%3A22%3A39&nowDate=20260106_2&cdnUrl=https%3A%2F%2Fcdn.dbpia.co.kr%2Fstatic&appVersion=1.0.0&buildTime=20260106102239&minify=.min&language=ko_KR&hasTopBanner=true-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Ji Hoon photo

Kim, Ji Hoon
COLLEGE OF ENGINEERING (SCHOOL OF ELECTRONIC ENGINEERING)
Read more

Altmetrics

Total Views & Downloads

BROWSE