머신러닝을 이용한 미세먼지 예측 연구

김삼용

doi:10.7465/jkdi.2023.34.5.763

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

머신러닝을 이용한 미세먼지 예측 연구A study on PM10 forecasting model using machine learning

Authors: 김삼용

Issue Date: Sep-2023

Publisher: 한국데이터정보과학회

Keywords: 머신러닝; 미세먼지 예측; CNN; GRU; LSTM; machine learning; PM10; forecasitng

Citation: 한국데이터정보과학회지, v.34, no.5, pp 763 - 773

Pages: 11

Journal Title: 한국데이터정보과학회지

Volume: 34

Number: 5

Start Page: 763

End Page: 773

URI: https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/68375

DOI: 10.7465/jkdi.2023.34.5.763

ISSN: 1598-9402

Abstract: 미세먼지란, 대기 중에 떠다니거나 흩날려 내려오는 입자상물질인 먼지 중 입자의 지름이 10㎍ 이하인 먼지를 말하며, 으로 표기하기도 한다. 이러한, 미세먼지는 매우 작은 크기로, 코나 기관지에서 걸러지지 않고 몸속에 스며들어 천식과 폐질환 또는 면역세포의 작용을 통해 염증을 일으키기도 한다. 최근 한국이 세계적으로 미세먼지 농도가 가장 높은 국가인 것이 밝혀졌는데, 미세먼지는 건강뿐 아니라 생태계 및 농작물에도 직접적인 영향을 미치기 때문에 정확한 예보 시스템을 통한 대책 마련을 강구하는 것이 중요하다. 따라서, 본 논문에서는 기상청이 제공하는 기상 데이터와 에어코리아에서 제공하는 대기오염물질 데이터를 이용하여 미세먼지 농도의 머신러닝 예측 성능 비교를 하고자 하였다. 지역으로는 황사의 유입 경로인 산둥반도와 가장 인접한 인천광역시의 데이터를 추출하였고, 인천시의 다양한 기상 요인 및 대기오염물질들의 상관관계 확인 후, 모형을 구축하였다. 모형으로는 MLP, RNN, LSTM, GRU 그리고 CNN을 사용하였고, 기본적인 하이퍼파라미터와 단일층으로 구성하여 예측 성능을 비교하였다. 그 후, GRU1 (단일층) 모형에 층을 추가한 GRU2 모형을 새롭게 구성하여 가장 예측 성능이 좋았던 GRU1 모형과 비교해보았다. 예측 성능은 테스트 데이터에서 MAE와 RMSE로 평가하였다. 대부분 비슷한 예측 성능을 보였지만, GRU1 모형이 MAE 8.80, RMSE 14.61로 다른 모형들에 비해 가장 성능이 우수하다는 것을 확인할 수 있었다. 가장 예측 성능이 낮은 모형은 MLP 모형이며, 그 뒤로는 RNN, LSTM, GRU2, CNN 순으로 예측 성능이 우수하였다.
Fine dust refers to dust with a particle diameter of less than 10μg among dust, which is a particulate matter floating or flying down in the atmosphere, and is also referred to as PM10. These fine dust is very small in size and permeates the body without being filtered from the nose or bronchial tubes, causing inflammation through asthma, lung disease, or the action of immune cells. Recently, it was found that Korea has the highest concentration of fine dust in the world, and it is important to take measures through an accurate forecast system because fine dust directly affects not only health but also ecosystems and crops. Therefore, this paper attempted to compare machine learning prediction performance of fine dust concentration using weather data provided by the Korea Meteorological Administration and air pollutant data provided by Air Korea. As for the region, data from Incheon Metropolitan City, which is the closest to the Shandong Peninsula, the inflow path of yellow dust, were extracted, and a model was built after confirming the correlation between various weather factors and air pollutants in Incheon. MLP, RNN, LSTM, GRU, and CNN were used as models, and predictive performance was compared by organizing basic hyperparameters and single layers. After that, the GRU2 model, which added layers to the GRU1 (single layer) model, was newly constructed and compared with the GRU1 model with the best prediction performance. Prediction performance was evaluated by MAE and RMSE in test data. Most of them showed similar predictive performance, but it was confirmed that the GRU1 model had the best performance compared to other models, with MAE 8.80 and RMSE 14.61. The model with the lowest prediction performance was the MLP model, followed by RNN, LSTM, GRU2, and CNN.

Files in This Item: There are no files associated with this item.

Appears in Collections: ETC > 1. Journal Articles

Show full item record

qrcode

Related Researcher

Researcher Kim, Sahm Yong photo

Kim, Sahm Yong: 대학원 (통계데이터사이언스학과)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :7,495,707; Today View :6,385

RSS_1.0 RSS_2.0 ATOM_1.0

84, Heukseok-ro, Dongjak-gu, Seoul, Republic of Korea (06974)02-820-6194

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE