Detailed Information

Cited 1 time in webofscience Cited 2 time in scopus
Metadata Downloads

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Sparkopen access

Authors
Azhir, ElhamHosseinzadeh, MehdiKhan, FaheemMosavi, Amir
Issue Date
Oct-2022
Publisher
MDPI
Keywords
access plan recommendation; parallel processing; Apache Hadoop; Apache Spark; big data; artificial intelligence; soft computing; cloud computing; data science; MapReduce
Citation
MATHEMATICS, v.10, no.19
Journal Title
MATHEMATICS
Volume
10
Number
19
URI
https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/86022
DOI
10.3390/math10193517
ISSN
2227-7390
Abstract
Access plan recommendation is a query optimization approach that executes new queries using prior created query execution plans (QEPs). The query optimizer divides the query space into clusters in the mentioned method. However, traditional clustering algorithms take a significant amount of execution time for clustering such large datasets. The MapReduce distributed computing model provides efficient solutions for storing and processing vast quantities of data. Apache Spark and Apache Hadoop frameworks are used in the present investigation to cluster different sizes of query datasets in the MapReduce-based access plan recommendation method. The performance evaluation is performed based on execution time. The results of the experiments demonstrated the effectiveness of parallel query clustering in achieving high scalability. Furthermore, Apache Spark achieved better performance than Apache Hadoop, reaching an average speedup of 2x.
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 컴퓨터공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Khan, Faheem photo

Khan, Faheem
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE