A technique for parallel query optimization using MapReduce framework and a semantic-based clustering method
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Azhir, Elham | - |
dc.contributor.author | Navimipour, Nima Jafari | - |
dc.contributor.author | Hosseinzadeh, Mehdi | - |
dc.contributor.author | Sharifi, Arash | - |
dc.contributor.author | Darwesh, Aso | - |
dc.date.accessioned | 2021-07-04T07:40:59Z | - |
dc.date.available | 2021-07-04T07:40:59Z | - |
dc.date.created | 2021-06-18 | - |
dc.date.issued | 2021-06-01 | - |
dc.identifier.issn | 2376-5992 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/81553 | - |
dc.description.abstract | Query optimization is the process of identifying the best Query Execution Plan (QEP). The query optimizer produces a close to optimal QEP for the given queries based on the minimum resource usage. The problem is that for a given query, there are plenty of different equivalent execution plans, each with a corresponding execution cost. To produce an effective query plan thus requires examining a large number of alternative plans. Access plan recommendation is an alternative technique to database query optimization, which reuses the previously-generated QEPs to execute new queries. In this technique, the query optimizer uses clustering methods to identify groups of similar queries. However, clustering such large datasets is challenging for traditional clustering algorithms due to huge processing time. Numerous cloud-based platforms have been introduced that offer low-cost solutions for the processing of distributed queries such as Hadoop, Hive, Pig, etc. This paper has applied and tested a model for clustering variant sizes of large query datasets parallelly using MapReduce. The results demonstrate the effectiveness of the parallel implementation of query workloads clustering to achieve good scalability. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | PEERJ INC | - |
dc.relation.isPartOf | PEERJ COMPUTER SCIENCE | - |
dc.title | A technique for parallel query optimization using MapReduce framework and a semantic-based clustering method | - |
dc.type | Article | - |
dc.type.rims | ART | - |
dc.description.journalClass | 1 | - |
dc.identifier.wosid | 000658894100001 | - |
dc.identifier.doi | 10.7717/peerj-cs.580 | - |
dc.identifier.bibliographicCitation | PEERJ COMPUTER SCIENCE, v.7 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.scopusid | 2-s2.0-85108551777 | - |
dc.citation.title | PEERJ COMPUTER SCIENCE | - |
dc.citation.volume | 7 | - |
dc.contributor.affiliatedAuthor | Hosseinzadeh, Mehdi | - |
dc.type.docType | Article | - |
dc.subject.keywordAuthor | Query optimization | - |
dc.subject.keywordAuthor | Access plan recommendation | - |
dc.subject.keywordAuthor | Cluster computing | - |
dc.subject.keywordAuthor | Parallel Processing | - |
dc.subject.keywordAuthor | MapReduce | - |
dc.subject.keywordAuthor | DBSCAN Algorithm | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Theory & Methods | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.