High-performance IoT streaming data prediction system using Spark: a case study of air pollution
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Jin, Ho-Yong | - |
dc.contributor.author | Jung, Eun-Sung | - |
dc.contributor.author | Lee, Duckki | - |
dc.date.available | 2021-03-17T06:50:19Z | - |
dc.date.created | 2021-02-26 | - |
dc.date.issued | 2020-09 | - |
dc.identifier.issn | 0941-0643 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/hongik/handle/2020.sw.hongik/11575 | - |
dc.description.abstract | Internet-of-Things (IoT) devices are becoming prevalent, and some of them, such as sensors, generate continuous time-series data, i.e., streaming data. These IoT streaming data are one of Big Data sources, and they require careful consideration for efficient data processing and analysis. Deep learning is emerging as a solution to IoT streaming data analytics. However, there is a persistent problem in deep learning that it takes a long time to learn neural networks. In this paper, we propose a high-performance IoT streaming data prediction system to improve the learning speed and to predict in real time. We showed the efficacy of the system through a case study of air pollution. The experimental results show that the modified LSTM autoencoder model shows the best performance compared to a generic LSTM model. We noticed that achieving the best performance requires optimizing many parameters, including learning rate, epoch, memory cell size, input timestep size, and the number of features/predictors. In that regard, we show that the high-performance data learning/prediction frameworks (e.g., Spark, Dist-Keras, and Hadoop) are essential to rapidly fine-tune a model for training and testing before real deployment of the model as data accumulate. | - |
dc.publisher | SPRINGER LONDON LTD | - |
dc.title | High-performance IoT streaming data prediction system using Spark: a case study of air pollution | - |
dc.type | Article | - |
dc.contributor.affiliatedAuthor | Jung, Eun-Sung | - |
dc.identifier.doi | 10.1007/s00521-019-04678-9 | - |
dc.identifier.scopusid | 2-s2.0-85076923245 | - |
dc.identifier.wosid | 000560557000008 | - |
dc.identifier.bibliographicCitation | NEURAL COMPUTING & APPLICATIONS, v.32, no.17, pp.13147 - 13154 | - |
dc.relation.isPartOf | NEURAL COMPUTING & APPLICATIONS | - |
dc.citation.title | NEURAL COMPUTING & APPLICATIONS | - |
dc.citation.volume | 32 | - |
dc.citation.number | 17 | - |
dc.citation.startPage | 13147 | - |
dc.citation.endPage | 13154 | - |
dc.type.rims | ART | - |
dc.type.docType | Article | - |
dc.description.journalClass | 1 | - |
dc.description.journalRegisteredClass | scie | - |
dc.description.journalRegisteredClass | scopus | - |
dc.relation.journalResearchArea | Computer Science | - |
dc.relation.journalWebOfScienceCategory | Computer Science, Artificial Intelligence | - |
dc.subject.keywordAuthor | Long Short-Term Memory (LSTM) | - |
dc.subject.keywordAuthor | Distributed deep learning | - |
dc.subject.keywordAuthor | Distributed Keras (Dist-Keras) | - |
dc.subject.keywordAuthor | Apache Spark | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
94, Wausan-ro, Mapo-gu, Seoul, 04066, Korea02-320-1314
COPYRIGHT 2020 HONGIK UNIVERSITY. ALL RIGHTS RESERVED.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.