OpenMP-based parallel implementation of matrix-matrix multiplication on the Intel Knights Landing

Lim, R.; Lee, Y.; Kim, R.; Choi, J.

doi:10.1145/3176364.3176374

Detailed Information

Cited 0 time in webofscience

Cited 3 time in scopus

Metadata Downloads

OpenMP-based parallel implementation of matrix-matrix multiplication on the Intel Knights Landing

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lim, R.	-
dc.contributor.author	Lee, Y.	-
dc.contributor.author	Kim, R.	-
dc.contributor.author	Choi, J.	-
dc.date.available	2019-04-10T09:56:17Z	-
dc.date.created	2018-09-12	-
dc.date.issued	2018-01	-
dc.identifier.isbn	9781450363471	-
dc.identifier.uri	http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/32428	-
dc.description.abstract	The second generation Intel Xeon Phi processor codenamed Knights Landing (KNL) have emerged with 2D tile mesh architecture. Implementing of the general matrix-matrix multiplication on a new architecture is an important practice. To date, there has not been a sufficient description on a parallel implementation of the general matrix-matrix multiplication. In this study, we describe the parallel implementation of the double-precision general matrix-matrix multiplication (DGEMM) with OpenMP on the KNL. The implementation is based on the blocked matrix-matrix multiplication. We propose a method for choosing the cache block sizes and discuss the parallelism within the implementation of DGEMM. We show that the performance of DGEMM varies by the thread affinity environment variables. We conducted the performance experiments with the Intel Xeon Phi 7210 and 7250. The performance experiments validate our method. © 2018 Association for Computing Machinery.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	Association for Computing Machinery	-
dc.relation.isPartOf	ACM International Conference Proceeding Series	-
dc.title	OpenMP-based parallel implementation of matrix-matrix multiplication on the Intel Knights Landing	-
dc.type	Conference	-
dc.identifier.doi	10.1145/3176364.3176374	-
dc.type.rims	CONF	-
dc.identifier.bibliographicCitation	2018 Workshop on High Performance Computing Asia, HPC Asia 2018, pp.63 - 66	-
dc.description.journalClass	2	-
dc.identifier.scopusid	2-s2.0-85043240112	-
dc.citation.conferencePlace	US	-
dc.citation.endPage	66	-
dc.citation.startPage	63	-
dc.citation.title	2018 Workshop on High Performance Computing Asia, HPC Asia 2018	-
dc.contributor.affiliatedAuthor	Choi, J.	-
dc.type.docType	Conference Paper	-

Files in This Item: There are no files associated with this item.

Appears in Collections: College of Information Technology > School of Computer Science and Engineering > 2. Conference Papers

Show simple item record

qrcode

Related Researcher

Researcher Choi, Jaeyoung photo

Choi, Jaeyoung: College of Information Technology (School of Computer Science and Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :8,381,501; Today View :6,865

RSS_1.0 RSS_2.0 ATOM_1.0

Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE