Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

A compiler-based approach for GPGPU performance calibration using TLP modulation (WIP Paper)

Full metadata record
DC Field Value Language
dc.contributor.authorYu, Yongseung-
dc.contributor.authorKang, Seokwon-
dc.contributor.authorPark, Yongjun-
dc.date.accessioned2022-07-09T14:02:31Z-
dc.date.available2022-07-09T14:02:31Z-
dc.date.created2021-05-13-
dc.date.issued2019-06-
dc.identifier.issn0000-0000-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/147636-
dc.description.abstractModern GPUs are the most successful accelerators as they provide outstanding performance gain by using CUDA or OpenCL programming models. For maximum performance, programmers typically try to maximize the number of thread blocks of target programs, and GPUs also generally attempt to allocate the maximum number of thread blocks to their GPU cores. However, many recent studies have pointed out that simply allocating the maximum number of thread blocks to GPU cores does not always guarantee the best performance, and identifying proper number of thread blocks per GPU core is a major challenge. Despite these studies, most existing architectural techniques cannot be directly applied to current GPU hardware, and the optimal number of thread blocks can vary significantly depending on the target GPU and application characteristics. To solve these problems, this study proposes a just-in-time thread block number adjustment system using CUDA binary modification upon an LLVM compiler framework, referred to as the CTA Limiter, in order to dynamically maximize GPU performance on real GPUs without reprogramming. The framework gradually reduces the number of concurrent thread blocks of target CUDA workloads using extra shared memory allocation, and compares the execution time with the previous version to automatically identify the optimal number of co-running thread blocks per GPU Core. The results showed meaningful performance improvements, averaging at 30%, 40%, and 44%, in GTX 960, GTX 1050, and GTX 1080 Ti, respectively.-
dc.language영어-
dc.language.isoen-
dc.publisherAssociation for Computing Machinery-
dc.titleA compiler-based approach for GPGPU performance calibration using TLP modulation (WIP Paper)-
dc.typeArticle-
dc.contributor.affiliatedAuthorPark, Yongjun-
dc.identifier.doi10.1145/3316482.3326343-
dc.identifier.scopusid2-s2.0-85070991746-
dc.identifier.bibliographicCitationProceedings of the ACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES), pp.193 - 197-
dc.relation.isPartOfProceedings of the ACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES)-
dc.citation.titleProceedings of the ACM SIGPLAN Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES)-
dc.citation.startPage193-
dc.citation.endPage197-
dc.type.rimsART-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusCalibration-
dc.subject.keywordPlusEmbedded systems-
dc.subject.keywordPlusGraphics processing unit-
dc.subject.keywordPlusMemory architecture-
dc.subject.keywordPlusProgram compilers-
dc.subject.keywordPlusBinary modification-
dc.subject.keywordPlusCode instrumentation-
dc.subject.keywordPlusConcurrent threads-
dc.subject.keywordPlusLLVM-
dc.subject.keywordPlusNumber of threads-
dc.subject.keywordPlusPerformance calibrations-
dc.subject.keywordPlusPerformance Gain-
dc.subject.keywordPlusProgramming models-
dc.subject.keywordPlusComputer hardware-
dc.subject.keywordAuthorCode Instrumentation-
dc.subject.keywordAuthorGPU-
dc.subject.keywordAuthorLLVM-
dc.subject.keywordAuthorPerformance Calibration-
dc.identifier.urlhttps://dl.acm.org/doi/10.1145/3316482.3326343-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Yong jun photo

Park, Yong jun
서울 공과대학 (서울 컴퓨터소프트웨어학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE