Analysis of the Yearbook from the Korea Meteorological Administration using a text-mining agorithm
- Authors
- Sun, Hyunseok; Lim, Changwon; Lee, YungSeop
- Issue Date
- Aug-2017
- Publisher
- KOREAN STATISTICAL SOC
- Keywords
- text-mining; unstructured format; the Korea Meteorological Administration; word cloud
- Citation
- KOREAN JOURNAL OF APPLIED STATISTICS, v.30, no.4, pp 603 - 613
- Pages
- 11
- Journal Title
- KOREAN JOURNAL OF APPLIED STATISTICS
- Volume
- 30
- Number
- 4
- Start Page
- 603
- End Page
- 613
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/4197
- DOI
- 10.5351/KJAS.2017.30.4.603
- ISSN
- 1225-066X
2383-5818
- Abstract
- Many people have recently posted about personal interests on social media. The development of the Internet and computer technology has enabled the storage of digital forms of documents that has resulted in an explosion of the amount of textual data generated; subsequently there is an increased demand for technology to create valuable information from a large number of documents. A text mining technique is often used since text-based data is mostly composed of unstructured forms that are not suitable for the application of statistical analysis or data mining techniques. This study analyzed the Meteorological Yearbook data of the Korea Meteorological Administration (KMA) with a text mining technique. First, a term dictionary was constructed through preprocessing and a term-document matrix was generated. This term dictionary was then used to calculate the annual frequency of term, and observe the change in relative frequency for frequently appearing words. We also used regression analysis to identify terms with increasing and decreasing trends. We analyzed the trends in the Meteorological Yearbook of the KMA and analyzed trends of weather related news, weather status, and status of work trends that the KMA focused on. This study is to provide useful information that can help analyze and improve the meteorological services and reflect meteorological policy.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Business & Economics > Department of Applied Statistics > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/4197)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.