Asymptotically Optimal Merging on ManyCore GPUs

Kutzner, Arne; Kim, Pok-Son; Park, Won-Kwang

doi:10.1587/transinf.E95.D.2769

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Asymptotically Optimal Merging on ManyCore GPUs

Full metadata record

DC Field	Value	Language
dc.contributor.author	Kutzner, Arne	-
dc.contributor.author	Kim, Pok-Son	-
dc.contributor.author	Park, Won-Kwang	-
dc.date.accessioned	2022-07-16T12:33:53Z	-
dc.date.available	2022-07-16T12:33:53Z	-
dc.date.issued	2012-12	-
dc.identifier.issn	0916-8532	-
dc.identifier.issn	1745-1361	-
dc.identifier.uri	https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/164074	-
dc.description.abstract	We propose a family of algorithms for efficiently merging on contemporary GPUs, so that each algorithm requires O(m log(n/m + 1)) element comparisons, where m and n are the sizes of the input sequences with m <= n. According to the lower bounds for merging all proposed algorithms are asymptotically optimal regarding the number of necessary comparisons. First we introduce a parallely structured algorithm that splits a merging problem of size 2(l) into 2(i) subproblems of size 2(l-i), for some arbitrary i with (0 <= i <= l). This algorithm represents a merger for i = 1 but it is rather inefficient in this case. The efficiency is boosted by moving to a two stage approach where the splitting process stops at some predetermined level and transfers control to several parallely operating block-mergers. We formally prove the asymptotic optimality of the splitting process and show that for symmetrically sized inputs our approach delivers up to 4 times faster runtimes than the thrust: :merge function that is part of the Thrust library. For assessing the value of our merging technique in the context of sorting we construct and evaluate a MergeSort on top of it. In the context of our benchmarking the resulting MergeSort clearly outperforms the MergeSort implementation provided by the Thrust library as well as Cederman's GPU optimized variant of QuickSort.	-
dc.format.extent	9	-
dc.language	영어	-
dc.language.iso	ENG	-
dc.publisher	Oxford University Press	-
dc.title	Asymptotically Optimal Merging on ManyCore GPUs	-
dc.type	Article	-
dc.publisher.location	일본	-
dc.identifier.doi	10.1587/transinf.E95.D.2769	-
dc.identifier.scopusid	2-s2.0-84870673096	-
dc.identifier.wosid	000313146300004	-
dc.identifier.bibliographicCitation	IEICE Transactions on Information and Systems, v.E95D, no.12, pp 2769 - 2777	-
dc.citation.title	IEICE Transactions on Information and Systems	-
dc.citation.volume	E95D	-
dc.citation.number	12	-
dc.citation.startPage	2769	-
dc.citation.endPage	2777	-
dc.type.docType	Article	-
dc.description.isOpenAccess	N	-
dc.description.journalRegisteredClass	scopus	-
dc.relation.journalResearchArea	Computer Science	-
dc.relation.journalWebOfScienceCategory	Computer Science, Information Systems	-
dc.relation.journalWebOfScienceCategory	Computer Science, Software Engineering	-
dc.subject.keywordPlus	PARALLEL	-
dc.subject.keywordPlus	GRAPHICS	-
dc.subject.keywordAuthor	parallel algorithms	-
dc.subject.keywordAuthor	GPGPU	-
dc.subject.keywordAuthor	complexity	-
dc.subject.keywordAuthor	merging	-
dc.subject.keywordAuthor	sorting	-
dc.identifier.url	https://www.jstage.jst.go.jp/article/transinf/E95.D/12/E95.D_2769/_article	-

Files in This Item: Go to Link

Appears in Collections: 서울 공과대학 > 서울 정보시스템학과 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Kutzner, Arne photo

Kutzner, Arne: COLLEGE OF ENGINEERING (DEPARTMENT OF INFORMATION SYSTEMS)

Read more

Altmetrics

Total Views & Downloads

RSS_1.0 RSS_2.0 ATOM_1.0

222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE