Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Efficient GPU multitasking with latency minimization and cache boostingopen access

Authors
Kim, JihoChu, MinsungPark, Yongjun
Issue Date
Apr-2017
Publisher
IEICE-INST ELECTRONICS INFORMATION COMMUNICATIONS ENG
Keywords
GPGPU; multitasking; energy; resource sharing; workload balancing
Citation
IEICE ELECTRONICS EXPRESS, v.14, no.7, pp.1 - 12
Indexed
SCIE
SCOPUS
Journal Title
IEICE ELECTRONICS EXPRESS
Volume
14
Number
7
Start Page
1
End Page
12
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/152495
DOI
10.1587/elex.14.20161158
ISSN
1349-2543
Abstract
GPU spatial multitasking has been proven to be quite effective at executing different applications concurrently using SM partitioning. However, while it maximizes total throughput, latency-critical applications often cannot meet their deadlines due to the increased execution time. Furthermore, SM partitioning cannot allocate the appropriate L1 cache size per kernel. To solve these problems, this paper proposes a new application-aware resource allocation framework called GPU Fine-Tuner, for assigning appropriate resources to GPU kernels. To minimize the execution time of latency-constrained applications, it assigns them more SMs when performance is not affected. It also increases the cache size of SMs for cache-sensitive kernels using resource borrowing from neighbors for cache-insensitive kernels. Experimental results show that the Fine-Tuner outperforms GPU spatial multitasking with up to 15% less average latency without performance degradation.
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Yong jun photo

Park, Yong jun
서울 공과대학 (서울 컴퓨터소프트웨어학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE