Detailed Information

Cited 7 time in webofscience Cited 13 time in scopus
Metadata Downloads

Charon: Specialized near-memory processing architecture for clearing dead objects in memory

Authors
Jang, J.[Jang, J.]Heo, J.[Heo, J.]Lee, Y.[Lee, Y.]Won, J.[Won, J.]Kim, S.[Kim, S.]Jung, S.J.[Jung, S.J.]Jang, H.[Jang, H.]Ham, T.J.[Ham, T.J.]Lee, J.W.[Lee, J.W.]
Issue Date
2019
Publisher
IEEE Computer Society
Keywords
Domain-specific architecture; Garbage collection; Java virtual machine; Memory management; Near-memory processing
Citation
Proceedings of the Annual International Symposium on Microarchitecture, MICRO, pp.726 - 739
Indexed
SCOPUS
Journal Title
Proceedings of the Annual International Symposium on Microarchitecture, MICRO
Start Page
726
End Page
739
URI
https://scholarworks.bwise.kr/skku/handle/2021.sw.skku/13873
DOI
10.1145/3352460.3358297
ISSN
1072-4451
Abstract
programming, saving a programmer from many nasty memoryrelated bugs. However, these productivity benefits come with a cost in terms of application throughput, worst-case latency, and energy consumption. Since the first introduction of GC by the Lisp programming language in the 1950s, a myriad of hardware and software techniques have been proposed to reduce this cost. While the idea of accelerating GC in hardware is appealing, its impact has been very limited due to narrow coverage, lack of flexibility, intrusive system changes, and significant hardware cost. Even with specialized hardware GC performance is eventually limited by memory bandwidth bottleneck. Fortunately, emerging 3D stacked DRAM technologies shed new light on this decades-old problem by enabling efficient near-memory processing with ample memory bandwidth. Thus, we propose Charon1, the first 3D stacked memory-based GC accelerator. Through a detailed performance analysis of HotSpot JVM, we derive a set of key algorithmic primitives based on their GC time coverage and implementation complexity in hardware. Then we devise a specialized processing unit to substantially improve their memory-level parallelism and throughput with a low hardware cost. Our evaluation of Charon with the full-production HotSpot JVM running two big data analytics frameworks, Spark and GraphChi, demonstrates a 3.29 geomean speedup and 60.7% energy savings for GC over the baseline 8-core out-of-order processor. © 2019 Association for Computing Machinery.
Files in This Item
There are no files associated with this item.
Appears in
Collections
Information and Communication Engineering > School of Electronic and Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE