Research on Big Data Integration Method
- Authors
- 김지현; 조영임
- Issue Date
- 2017
- Publisher
- 한국컴퓨터정보학회
- Keywords
- R; Big Dаtа; Hаdoop; ff; Streаming; Rhipe; RHаdoop
- Citation
- 한국컴퓨터정보학회논문지, v.22, no.1, pp.49 - 56
- Journal Title
- 한국컴퓨터정보학회논문지
- Volume
- 22
- Number
- 1
- Start Page
- 49
- End Page
- 56
- URI
- https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/7189
- ISSN
- 1598-849X
- Abstract
- In this paper we propose the approach for big data integration so as to analyze, visualize and predict the future of the trend of the market, and that is to get the integration data model using the R language which is the future of the statistics and the Hadoop which is a parallel processing for the data. As four approaching methods using R and Hadoop, ff package in R, R and Streaming as Hadoop utility, and Rhipe and RHadoop as R and Hadoop interface packages are used, and the strength and weakness of four methods are described and analyzed, so Rhipe and RHadoop are proposed as a complete set of data integration model. The integration of R, which is popular for processing statistical algorithm and Hadoop contains Distributed File System and resource management platform and can implement the MapReduce programming model gives us a new environment where in R code can be written and deployed in Hadoop without any data movement. This model allows us to predictive analysis with high performance and deep understand over the big data.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - IT융합대학 > 컴퓨터공학과 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.