Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Design and implementation of web scraping platform using spring framework based on distributed hadoop ecosystem

Authors
Yoo J.-S.Lee M.-H.
Issue Date
Sep-2019
Publisher
Science and Engineering Research Support Society
Keywords
Big Data; Distributed Hadoop Ecosystem; HBase; HDFS; Spring Framework; Web Scraping
Citation
International Journal of Advanced Science and Technology, v.28, no.5, pp.174 - 182
Journal Title
International Journal of Advanced Science and Technology
Volume
28
Number
5
Start Page
174
End Page
182
URI
https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/26535
ISSN
2005-4238
Abstract
Background/Objectives: Recently, a rapid increase of general data and informal data and the quick data generation in all fields have a great effect on how to utilize data.The demand is increasing for utilizing Big Data in making important decisions for organizations by interpreting various patterns in a number of heterogeneous data, and predicting the future. Methods/Statistical analysis: Furthermore, it is necessary to provide services quickly based on up-to-date information. However, in most research, the collection, loading and processing of data have been insufficient and great attention has been paid to the analysis of data. Findings: Thus, this research collects the data searched with keywords through the Spring Framework using next generation web standards and through Web scraping based on the Hadoop 2.0 Ecosystem, loads the collected data on to a Hadoop Distributed File System (HDFS) and HBase, and designs and implements a Big Data utilization system that can schematize, through a word cloud, the results of analysis of keyword, title, contents and morpheme on the basis of contents and nouns extracted from the loaded data with a Twitter morpheme analyzer. Improvements/Applications: This research intends to provide a platform reference model that is applicable to enterprise groupware to which the Distributed Hadoop Ecosystem and the Spring Framework under next generation web standards are applied. © 2019 SERSC.
Files in This Item
There are no files associated with this item.
Appears in
Collections
공과대학 > 산업경영공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yoo, Jung Sang photo

Yoo, Jung Sang
Engineering (Department of Industrial Engineering)
Read more

Altmetrics

Total Views & Downloads

BROWSE