OpenMP-based parallel implementation of matrix-matrix multiplication on the Intel Knights Landing
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lim, R. | - |
dc.contributor.author | Lee, Y. | - |
dc.contributor.author | Kim, R. | - |
dc.contributor.author | Choi, J. | - |
dc.date.available | 2019-04-10T09:56:17Z | - |
dc.date.created | 2018-09-12 | - |
dc.date.issued | 2018-01 | - |
dc.identifier.isbn | 9781450363471 | - |
dc.identifier.uri | http://scholarworks.bwise.kr/ssu/handle/2018.sw.ssu/32428 | - |
dc.description.abstract | The second generation Intel Xeon Phi processor codenamed Knights Landing (KNL) have emerged with 2D tile mesh architecture. Implementing of the general matrix-matrix multiplication on a new architecture is an important practice. To date, there has not been a sufficient description on a parallel implementation of the general matrix-matrix multiplication. In this study, we describe the parallel implementation of the double-precision general matrix-matrix multiplication (DGEMM) with OpenMP on the KNL. The implementation is based on the blocked matrix-matrix multiplication. We propose a method for choosing the cache block sizes and discuss the parallelism within the implementation of DGEMM. We show that the performance of DGEMM varies by the thread affinity environment variables. We conducted the performance experiments with the Intel Xeon Phi 7210 and 7250. The performance experiments validate our method. © 2018 Association for Computing Machinery. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | Association for Computing Machinery | - |
dc.relation.isPartOf | ACM International Conference Proceeding Series | - |
dc.title | OpenMP-based parallel implementation of matrix-matrix multiplication on the Intel Knights Landing | - |
dc.type | Conference | - |
dc.identifier.doi | 10.1145/3176364.3176374 | - |
dc.type.rims | CONF | - |
dc.identifier.bibliographicCitation | 2018 Workshop on High Performance Computing Asia, HPC Asia 2018, pp.63 - 66 | - |
dc.description.journalClass | 2 | - |
dc.identifier.scopusid | 2-s2.0-85043240112 | - |
dc.citation.conferencePlace | US | - |
dc.citation.endPage | 66 | - |
dc.citation.startPage | 63 | - |
dc.citation.title | 2018 Workshop on High Performance Computing Asia, HPC Asia 2018 | - |
dc.contributor.affiliatedAuthor | Choi, J. | - |
dc.type.docType | Conference Paper | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
Soongsil University Library 369 Sangdo-Ro, Dongjak-Gu, Seoul, Korea (06978)02-820-0733
COPYRIGHT ⓒ SOONGSIL UNIVERSITY, ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.