Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Azhir, Elham | - |
dc.contributor.author | Hosseinzadeh, Mehdi | - |
dc.contributor.author | Khan, Faheem | - |
dc.contributor.author | Mosavi, Amir | - |
dc.date.accessioned | 2022-11-11T07:40:23Z | - |
dc.date.available | 2022-11-11T07:40:23Z | - |
dc.date.created | 2022-11-08 | - |
dc.date.issued | 2022-10 | - |
dc.identifier.issn | 2227-7390 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/86022 | - |
dc.description.abstract | Access plan recommendation is a query optimization approach that executes new queries using prior created query execution plans (QEPs). The query optimizer divides the query space into clusters in the mentioned method. However, traditional clustering algorithms take a significant amount of execution time for clustering such large datasets. The MapReduce distributed computing model provides efficient solutions for storing and processing vast quantities of data. Apache Spark and Apache Hadoop frameworks are used in the present investigation to cluster different sizes of query datasets in the MapReduce-based access plan recommendation method. The performance evaluation is performed based on execution time. The results of the experiments demonstrated the effectiveness of parallel query clustering in achieving high scalability. Furthermore, Apache Spark achieved better performance than Apache Hadoop, reaching an average speedup of 2x. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | MDPI | - |
dc.relation.isPartOf | MATHEMATICS | - |
dc.title | Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark | - |
dc.type | Article | - |
dc.type.rims | ART | - |
dc.description.journalClass | 1 | - |
dc.identifier.wosid | 000867006100001 | - |
dc.identifier.doi | 10.3390/math10193517 | - |
dc.identifier.bibliographicCitation | MATHEMATICS, v.10, no.19 | - |
dc.description.isOpenAccess | Y | - |
dc.identifier.scopusid | 2-s2.0-85139762242 | - |
dc.citation.title | MATHEMATICS | - |
dc.citation.volume | 10 | - |
dc.citation.number | 19 | - |
dc.contributor.affiliatedAuthor | Khan, Faheem | - |
dc.type.docType | Article | - |
dc.subject.keywordAuthor | access plan recommendation | - |
dc.subject.keywordAuthor | parallel processing | - |
dc.subject.keywordAuthor | Apache Hadoop | - |
dc.subject.keywordAuthor | Apache Spark | - |
dc.subject.keywordAuthor | big data | - |
dc.subject.keywordAuthor | artificial intelligence | - |
dc.subject.keywordAuthor | soft computing | - |
dc.subject.keywordAuthor | cloud computing | - |
dc.subject.keywordAuthor | data science | - |
dc.subject.keywordAuthor | MapReduce | - |
dc.subject.keywordPlus | ALGORITHM | - |
dc.relation.journalResearchArea | Mathematics | - |
dc.relation.journalWebOfScienceCategory | Mathematics | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.