Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

PreScaler: An efficient system-aware precision scaling framework on heterogeneous systems

Full metadata record
DC Field Value Language
dc.contributor.authorKang, Seokwon-
dc.contributor.authorChoi, Kyunghwan-
dc.contributor.authorPark, Yongjun-
dc.date.accessioned2022-07-08T14:04:40Z-
dc.date.available2022-07-08T14:04:40Z-
dc.date.created2021-05-13-
dc.date.issued2020-02-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/146195-
dc.description.abstractGraphics processing units (GPUs) have been commonly utilized to accelerate multiple emerging applications, such as big data processing and machine learning. While GPUs are proven to be effective, approximate computing, to trade off performance with accuracy, is one of the most common solutions for further performance improvement. Precision scaling of originally high-precision values into lower-precision values has recently been the most widely used GPU-side approximation technique, including hardware-level half-precision support. Although several approaches to find optimal mixed-precision configuration of GPU-side kernels have been introduced, total program performance gain is often low because total execution time is the combination of data transfer, type conversion, and kernel execution. As a result, kernel-level scaling may incur high type-conversion overhead of the kernel input/output data. To address this problem, this paper proposes an automatic precision scaling framework called PreScaler that maximizes the program performance at the memory object level by considering whole OpenCL program flows. The main difficulty is that the best configuration cannot be easily predicted due to various application- and system-specific characteristics. PreScaler solves this problem using search space minimization and decision-tree-based search processes. First, it minimizes the number of test configurations based on the information from system inspection and dynamic profiling. Then, it finds the best memory-object level mixed-precision configuration using a decision-tree-based search. PreScaler achieves an average performance gain of 1.33x over the baseline while maintaining the target output quality level.-
dc.language영어-
dc.language.isoen-
dc.publisherAssociation for Computing Machinery, Inc-
dc.titlePreScaler: An efficient system-aware precision scaling framework on heterogeneous systems-
dc.typeArticle-
dc.contributor.affiliatedAuthorPark, Yongjun-
dc.identifier.doi10.1145/3368826.3377917-
dc.identifier.scopusid2-s2.0-85082111764-
dc.identifier.bibliographicCitationCGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization, pp.280 - 292-
dc.relation.isPartOfCGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization-
dc.citation.titleCGO 2020 - Proceedings of the 18th ACM/IEEE International Symposium on Code Generation and Optimization-
dc.citation.startPage280-
dc.citation.endPage292-
dc.type.rimsART-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusComputer graphics-
dc.subject.keywordPlusComputer hardware-
dc.subject.keywordPlusData transfer-
dc.subject.keywordPlusDecision trees-
dc.subject.keywordPlusEconomic and social effects-
dc.subject.keywordPlusFrequency dividing circuits-
dc.subject.keywordPlusGraphics processing unit-
dc.subject.keywordPlusProgram processors-
dc.subject.keywordPlusApproximation techniques-
dc.subject.keywordPlusCompiler-
dc.subject.keywordPlusEmerging applications-
dc.subject.keywordPlusHeterogeneous systems-
dc.subject.keywordPlusPrecision Scaling-
dc.subject.keywordPlusProfile-guided-
dc.subject.keywordPlusProgram performance-
dc.subject.keywordPlusRuntimes-
dc.subject.keywordPlusData handling-
dc.subject.keywordAuthorCompiler-
dc.subject.keywordAuthorHSA-
dc.subject.keywordAuthorPrecision Scaling-
dc.subject.keywordAuthorProfile-guided-
dc.subject.keywordAuthorRuntime-
dc.identifier.urlhttps://dl.acm.org/doi/10.1145/3368826.3377917-
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Park, Yong jun photo

Park, Yong jun
서울 공과대학 (서울 컴퓨터소프트웨어학부)
Read more

Altmetrics

Total Views & Downloads

BROWSE