SmCompactor: A workload-aware fine-grained resource management framework for GPGPUs
- Authors
- Chen, Q.; Chung, H.; Son, Y.; Kim, Y.; Yeom, H.Y.
- Issue Date
- Mar-2021
- Publisher
- Association for Computing Machinery
- Keywords
- GPU multitasking; GPU resource management; HPC; OS; parallel computing
- Citation
- Proceedings of the ACM Symposium on Applied Computing, pp 1147 - 1155
- Pages
- 9
- Journal Title
- Proceedings of the ACM Symposium on Applied Computing
- Start Page
- 1147
- End Page
- 1155
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/54361
- DOI
- 10.1145/3412841.3441989
- ISSN
- 0000-0000
- Abstract
- Recently, graphic processing unit (GPU) multitasking has become important in many platforms since an efficient GPU multitasking mechanism can enable more GPU-enabled tasks running on limited physical GPUs. However, current GPU multitasking technologies, such as NVIDIA Multi-Process Service (MPS) and Hyper-Q may not fully utilize GPU resources since they do not consider the efficient use of intra-GPU resources. In this paper, we present smCompactor, which is a fine-grained GPU multitasking framework to fully exploit intra-GPU resources for different workloads. smCompactor dispatches any particular thread blocks (TBs) of different GPU kernels to appropriate stream multiprocessors (SMs) based on our profiled results of workloads. With smCompactor, GPU resource utilization can be improved as we can run more workloads on a single GPU while their performance is maintained. The evaluation results show that smCompactor improves resource utilization in terms of the number of active SMs by up to 33% and it reduces the kernel execution time by up to 26% compared with NVIDIA MPS.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Software > School of Computer Science and Engineering > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.