Detailed Information

Cited 61 time in webofscience Cited 0 time in scopus
Metadata Downloads

Chimera: Collaborative Preemption for Multitasking on a Shared GPU

Authors
Park, Jason Jong KyuPark, YongjunMahlke, Scott
Issue Date
Apr-2015
Publisher
ASSOC COMPUTING MACHINERY
Keywords
Graphics Processing Unit; Preemptive Multitasking; Context Switch; Idempotence
Citation
ACM SIGPLAN NOTICES, v.50, no.4, pp.593 - 606
Journal Title
ACM SIGPLAN NOTICES
Volume
50
Number
4
Start Page
593
End Page
606
URI
https://scholarworks.bwise.kr/hongik/handle/2020.sw.hongik/10071
DOI
10.1145/2775054.2694346
ISSN
0362-1340
Abstract
The demand for multitasking on graphics processing units (GPUs) is constantly increasing as they have become one of the default components on modern computer systems along with traditional processors (CPUs). Preemptive multitasking on CPUs has been primarily supported through context switching. However, the same preemption strategy incurs substantial overhead due to the large context in GPUs. The overhead comes in two dimensions: a preempting kernel suffers from a long preemption latency, and the system throughput is wasted during the switch. Without precise control over the large preemption overhead, multitasking on GPUs has little use for applications with strict latency requirements. In this paper, we propose Chimera, a collaborative preemption approach that can precisely control the overhead for multitasking on GPUs. Chimera first introduces streaming multiprocessor (SM) flushing, which can instantly preempt an SM by detecting and exploiting idempotent execution. Chimera utilizes flushing collaboratively with two previously proposed preemption techniques for GPUs, namely context switching and draining to minimize throughput overhead while achieving a required preemption latency. Evaluations show that Chimera violates the deadline for only 0.2% of preemption requests when a 15 mu s preemption latency constraint is used. For multi-programmed workloads, Chimera can improve the average normalized turnaround time by 5.5x, and system throughput by 12.2%.
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Engineering > School of Electronic & Electrical Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetrics

Total Views & Downloads

BROWSE