Hybrid Embedding Framework for Memory-Efficient Recommendation Systems
- Authors
- Yang, Seung Jin; Lee, Hyuk Jae; Rhee, Chae Eun
- Issue Date
- Sep-2025
- Citation
- Proceedings - Design Automation Conference, pp 1 - 7
- Pages
- 7
- Indexed
- SCOPUS
- Journal Title
- Proceedings - Design Automation Conference
- Start Page
- 1
- End Page
- 7
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/209094
- DOI
- 10.1109/DAC63849.2025.11132826
- ISSN
- 0738-100X
0146-7123
- Abstract
- This study introduces a memory-efficient mixed representation for deep learning recommendation models (DLRM), addressing the embedding table memory bottleneck from growing data scale. By distinguishing between frequently accessed (hot) and infrequently accessed (cold) embeddings, we store hot embeddings in a compact table while representing cold embeddings using a deep hash embedding (DHE) network, significantly reducing memory usage. This hybrid approach performs table lookups for hot embeddings and parallelized computations for cold embeddings, minimizing training time while maintaining accuracy. Experimental results demonstrate that our method outperforms other embedding reduction techniques in memory efficiency, accuracy, and training speed in CPU-GPU hybrid environments. © 2025 Elsevier B.V., All rights reserved.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 융합전자공학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.