Detailed Information

Cited 2 time in webofscience Cited 3 time in scopus
Metadata Downloads

Parameter based tuning model for optimizing performance on GPU

Full metadata record
DC Field Value Language
dc.contributor.authorNhat-Phuong Tran-
dc.contributor.authorLee, Myungho-
dc.contributor.authorChoi, Jaeyoung-
dc.date.available2018-05-08T14:30:22Z-
dc.date.created2018-04-17-
dc.date.issued2017-09-
dc.identifier.issn1386-7857-
dc.identifier.urihttp://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/6256-
dc.description.abstractRecently, the graphic processing units (GPUs) are becoming increasingly popular for the high performance computing applications. Although the GPUs provide high peak performance, exploiting the full performance potential for application programs, however, leaves a challenging task to the programmers. When launching a parallel kernel of an application on the GPU, the programmer needs to carefully select the number of blocks (grid size) and the number of threads per block (block size). These values determine the degree of SIMD parallelism and the multithreading, and greatly influence the performance. With a huge range of possible combinations of these values, choosing the right grid size and the block size is not straightforward. In this paper, we propose a mathematical model for tuning the grid size and the block size based on the GPU architecture parameters. Using our model we first calculate a small set of candidate grid size and block size values, then search for the optimal values out of the candidate values through experiments. Our approach significantly reduces the potential search space instead of exhaustive search approaches in the previous research. Thus our approach can be practically applied to the real applications.-
dc.publisherSPRINGER-
dc.relation.isPartOfCLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS-
dc.titleParameter based tuning model for optimizing performance on GPU-
dc.typeArticle-
dc.identifier.doi10.1007/s10586-017-1003-4-
dc.type.rimsART-
dc.identifier.bibliographicCitationCLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, v.20, no.3, pp.2133 - 2142-
dc.description.journalClass1-
dc.identifier.wosid000407928800019-
dc.identifier.scopusid2-s2.0-85021759226-
dc.citation.endPage2142-
dc.citation.number3-
dc.citation.startPage2133-
dc.citation.titleCLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS-
dc.citation.volume20-
dc.contributor.affiliatedAuthorChoi, Jaeyoung-
dc.type.docTypeArticle-
dc.subject.keywordAuthorGPU-
dc.subject.keywordAuthorHigh performance computing-
dc.subject.keywordAuthorPerformance tuning-
dc.subject.keywordAuthorMulti-threading-
dc.subject.keywordAuthorMicro-benchmark-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
Files in This Item
There are no files associated with this item.
Appears in
Collections
College of Information Technology > School of Computer Science and Engineering > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Choi, Jaeyoung photo

Choi, Jaeyoung
College of Information Technology (School of Computer Science and Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE