Cited 0 time in
Hardware-software Co-design for Vector Similarity Search on HBM-PIM
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Kim, Nahyeon | - |
| dc.contributor.author | Kim, Sujin | - |
| dc.contributor.author | Jung, Min | - |
| dc.contributor.author | Noh, Haechannuri | - |
| dc.contributor.author | Kim, Ji-Hoon | - |
| dc.date.accessioned | 2026-01-19T06:00:24Z | - |
| dc.date.available | 2026-01-19T06:00:24Z | - |
| dc.date.issued | 2025-12 | - |
| dc.identifier.issn | 1598-1657 | - |
| dc.identifier.issn | 2233-4866 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/210365 | - |
| dc.description.abstract | Vector similarity search is a key component of Retrieval-Augmented Generation (RAG) for large language models (LLMs), requiring memory-intensive computations such as Manhattan distance, Euclidean distance, and cosine similarity. Processing-In-Memory (PIM) architectures offer a promising solution to accelerate these memory-bound operations by reducing data movement between memory and processor. This study presents a hardware-software co-design approach for optimizing distance computation on PIM. We first implemented and evaluated a vector similarity search application on a DRAM-based PIM platform using the developed computation library, achieving 44.2% and 59.0% speed improvements for Euclidean distance and cosine similarity, respectively, compared to the CPU. However, instruction set limitations led to performance bottlenecks despite software-level optimization. To address this, we utilized an HBM-based PIM simulator and proposed two new instructions, AMC and MAN, optimized for Euclidean and Manhattan distance computations. Evaluation using a simulator integrated with DRAMSim2 showed that the proposed instructions reduced the total cycle count for distance computations by up to 44% compared to the baseline, with performance gains increasing for larger input sizes. These results demonstrate that both software-level and instruction-level optimizations are essential to fully exploit the performance potential of PIM architectures for distance computation workloads. | - |
| dc.format.extent | 8 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | IEEK PUBLICATION CENTER | - |
| dc.title | Hardware-software Co-design for Vector Similarity Search on HBM-PIM | - |
| dc.type | Article | - |
| dc.publisher.location | 대한민국 | - |
| dc.identifier.doi | 10.5573/JSTS.2025.25.6.662 | - |
| dc.identifier.scopusid | 2-s2.0-105026859555 | - |
| dc.identifier.wosid | 001647850800005 | - |
| dc.identifier.bibliographicCitation | JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, v.25, no.6, pp 662 - 669 | - |
| dc.citation.title | JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE | - |
| dc.citation.volume | 25 | - |
| dc.citation.number | 6 | - |
| dc.citation.startPage | 662 | - |
| dc.citation.endPage | 669 | - |
| dc.identifier.kciid | ART003274361 | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.description.journalRegisteredClass | kci | - |
| dc.relation.journalResearchArea | Engineering | - |
| dc.relation.journalResearchArea | Physics | - |
| dc.relation.journalWebOfScienceCategory | Engineering, Electrical & Electronic | - |
| dc.relation.journalWebOfScienceCategory | Physics, Applied | - |
| dc.subject.keywordPlus | Application programs | - |
| dc.subject.keywordPlus | Data reduction | - |
| dc.subject.keywordPlus | Dynamic random access storage | - |
| dc.subject.keywordPlus | Integrated circuit design | - |
| dc.subject.keywordPlus | Memory architecture | - |
| dc.subject.keywordPlus | Software design | - |
| dc.subject.keywordPlus | Vectors | - |
| dc.subject.keywordAuthor | Processing-in-memory (PIM) | - |
| dc.subject.keywordAuthor | retrieval-augmented generation (RAG) | - |
| dc.subject.keywordAuthor | vector similarity search | - |
| dc.subject.keywordAuthor | distance computation | - |
| dc.subject.keywordAuthor | instruction set extension | - |
| dc.subject.keywordAuthor | hardware-software co-design | - |
| dc.subject.keywordAuthor | PIM simulator | - |
| dc.identifier.url | https://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE12497868&buildDate=2026-01-06+10%3A22%3A39&nowDate=20260106_2&cdnUrl=https%3A%2F%2Fcdn.dbpia.co.kr%2Fstatic&appVersion=1.0.0&buildTime=20260106102239&minify=.min&language=ko_KR&hasTopBanner=true | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
