Parallel computation of k-nearest neighbor joins using MapReduce
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Kim, Wooyeol | - |
dc.contributor.author | Kim, Younghoon | - |
dc.contributor.author | Shim, Kyuseok | - |
dc.date.accessioned | 2021-06-22T18:04:12Z | - |
dc.date.available | 2021-06-22T18:04:12Z | - |
dc.date.issued | 2016-12 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/erica/handle/2021.sw.erica/15597 | - |
dc.description.abstract | The k-nearest neighbor (kNN) join has recently attracted considerable attention due to its broad applications. However, processing fcNN joins is very expensive due to the quadratic nature of the join operation. Furthermore, since there is an increasing trend of applications to deal with big data, computing fcNN joins becomes more challenging. In order to process such big data, parallel and distributed computing using MapReduce recently have received a lot of attention. In this paper, we propose the efficient parallel algorithm KNN-MR to process the fcNN joins using MapReduce. To reduce not only the computational cost of fcNN joins but also the network cost of communicating across machines, we develop the novel vector projection pruning which enables us to identify non-fcNN points that are guaranteed not to be included in the result of a fcNN join. Our performance study confirms the effectiveness and scalability of the proposed algorithm. © 2016 IEEE. | - |
dc.format.extent | 10 | - |
dc.language | 영어 | - |
dc.language.iso | ENG | - |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | - |
dc.title | Parallel computation of k-nearest neighbor joins using MapReduce | - |
dc.type | Article | - |
dc.publisher.location | 미국 | - |
dc.identifier.doi | 10.1109/BigData.2016.7840662 | - |
dc.identifier.scopusid | 2-s2.0-85015175716 | - |
dc.identifier.wosid | 000399115000083 | - |
dc.identifier.bibliographicCitation | Proceedings - 2016 IEEE International Conference on Big Data, Big Data 2016, pp 696 - 705 | - |
dc.citation.title | Proceedings - 2016 IEEE International Conference on Big Data, Big Data 2016 | - |
dc.citation.startPage | 696 | - |
dc.citation.endPage | 705 | - |
dc.type.docType | Conference Paper | - |
dc.description.isOpenAccess | N | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
dc.subject.keywordPlus | Distributed computer systems | - |
dc.subject.keywordPlus | Learning algorithms | - |
dc.subject.keywordPlus | Motion compensation | - |
dc.subject.keywordPlus | Nearest neighbor search | - |
dc.subject.keywordPlus | Computational costs | - |
dc.subject.keywordPlus | Hadoop | - |
dc.subject.keywordPlus | K nearest neighbor (KNN) | - |
dc.subject.keywordPlus | K-nearest neighbors | - |
dc.subject.keywordPlus | Map-reduce | - |
dc.subject.keywordPlus | Parallel and distributed computing | - |
dc.subject.keywordPlus | Parallel Computation | - |
dc.subject.keywordPlus | Performance study | - |
dc.subject.keywordPlus | Big data | - |
dc.subject.keywordAuthor | Hadoop | - |
dc.subject.keywordAuthor | kNN joins | - |
dc.subject.keywordAuthor | MapReduce | - |
dc.identifier.url | https://ieeexplore.ieee.org/document/7840662 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
55 Hanyangdeahak-ro, Sangnok-gu, Ansan, Gyeonggi-do, 15588, Korea+82-31-400-4269 sweetbrain@hanyang.ac.kr
COPYRIGHT © 2021 HANYANG UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.