An efficient key partitioning scheme for heterogeneous MapReduce clusters
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Hanif, Muhammad | - |
dc.contributor.author | Lee, Choon hwa | - |
dc.date.accessioned | 2022-07-15T18:09:17Z | - |
dc.date.available | 2022-07-15T18:09:17Z | - |
dc.date.created | 2021-05-13 | - |
dc.date.issued | 2016-03 | - |
dc.identifier.issn | 1738-9445 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/154985 | - |
dc.description.abstract | Hadoop is a standard implementation of MapReduce framework for running data-intensive applications on the clusters of commodity servers. By thoroughly studying the framework we find out that the shuffle phase, all-to-all input data fetching phase in reduce task significantly affect the application performance. There is a problem of variance in both the intermediate key's frequencies and their distribution among data nodes throughout the cluster in Hadoop's MapReduce system. This variance in system causes network overhead which leads to unfairness on the reduce input among different data nodes in the cluster. Because of the above problem, applications experience performance degradation due to shuffle phase of MapReduce applications. We develop a new novel algorithm; unlike previous systems our algorithm considers a node's capabilities as heuristics to decide a better available trade-off for the locality and fairness in the system. By comparing with the default Hadoop's partitioning algorithm and Leen algorithm, on the average our approach achieve performance gain of 29% and 17%, respectively. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | An efficient key partitioning scheme for heterogeneous MapReduce clusters | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Lee, Choon hwa | - |
dc.identifier.doi | 10.1109/ICACT.2016.7423394 | - |
dc.identifier.scopusid | 2-s2.0-84962815473 | - |
dc.identifier.bibliographicCitation | International Conference on Advanced Communication Technology, ICACT, v.2016-March, pp.364 - 367 | - |
dc.relation.isPartOf | International Conference on Advanced Communication Technology, ICACT | - |
dc.citation.title | International Conference on Advanced Communication Technology, ICACT | - |
dc.citation.volume | 2016-March | - |
dc.citation.startPage | 364 | - |
dc.citation.endPage | 367 | - |
dc.type.rims | ART | - |
dc.type.docType | Conference Paper | - |
dc.description.journalClass | 1 | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scopus | - |
dc.subject.keywordPlus | Algorithms | - |
dc.subject.keywordPlus | Cloud computing | - |
dc.subject.keywordPlus | Economic and social effects | - |
dc.subject.keywordPlus | Electronic trading | - |
dc.subject.keywordPlus | Application performance | - |
dc.subject.keywordPlus | Context- awareness | - |
dc.subject.keywordPlus | Data-intensive application | - |
dc.subject.keywordPlus | Hadoop | - |
dc.subject.keywordPlus | Heterogeneous systems | - |
dc.subject.keywordPlus | Map-reduce | - |
dc.subject.keywordPlus | Partitioning algorithms | - |
dc.subject.keywordPlus | Performance degradation | - |
dc.subject.keywordPlus | Distributed computer systems | - |
dc.subject.keywordAuthor | Cloud Computing | - |
dc.subject.keywordAuthor | Context-awareness | - |
dc.subject.keywordAuthor | Hadoop | - |
dc.subject.keywordAuthor | Heterogeneous system | - |
dc.subject.keywordAuthor | MapReduce | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/7423394 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1365
COPYRIGHT © 2021 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.