Design and implementation of web scraping platform using spring framework based on distributed hadoop ecosystem
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Yoo J.-S. | - |
dc.contributor.author | Lee M.-H. | - |
dc.date.available | 2020-04-06T07:41:30Z | - |
dc.date.created | 2020-04-02 | - |
dc.date.issued | 2019-09 | - |
dc.identifier.issn | 2005-4238 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/26535 | - |
dc.description.abstract | Background/Objectives: Recently, a rapid increase of general data and informal data and the quick data generation in all fields have a great effect on how to utilize data.The demand is increasing for utilizing Big Data in making important decisions for organizations by interpreting various patterns in a number of heterogeneous data, and predicting the future. Methods/Statistical analysis: Furthermore, it is necessary to provide services quickly based on up-to-date information. However, in most research, the collection, loading and processing of data have been insufficient and great attention has been paid to the analysis of data. Findings: Thus, this research collects the data searched with keywords through the Spring Framework using next generation web standards and through Web scraping based on the Hadoop 2.0 Ecosystem, loads the collected data on to a Hadoop Distributed File System (HDFS) and HBase, and designs and implements a Big Data utilization system that can schematize, through a word cloud, the results of analysis of keyword, title, contents and morpheme on the basis of contents and nouns extracted from the loaded data with a Twitter morpheme analyzer. Improvements/Applications: This research intends to provide a platform reference model that is applicable to enterprise groupware to which the Distributed Hadoop Ecosystem and the Spring Framework under next generation web standards are applied. © 2019 SERSC. | - |
dc.language | 영어 | - |
dc.language.iso | en | - |
dc.publisher | Science and Engineering Research Support Society | - |
dc.relation.isPartOf | International Journal of Advanced Science and Technology | - |
dc.title | Design and implementation of web scraping platform using spring framework based on distributed hadoop ecosystem | - |
dc.type | Article | - |
dc.type.rims | ART | - |
dc.description.journalClass | 1 | - |
dc.identifier.bibliographicCitation | International Journal of Advanced Science and Technology, v.28, no.5, pp.174 - 182 | - |
dc.description.isOpenAccess | N | - |
dc.identifier.scopusid | 2-s2.0-85080111796 | - |
dc.citation.endPage | 182 | - |
dc.citation.startPage | 174 | - |
dc.citation.title | International Journal of Advanced Science and Technology | - |
dc.citation.volume | 28 | - |
dc.citation.number | 5 | - |
dc.contributor.affiliatedAuthor | Yoo J.-S. | - |
dc.type.docType | Article | - |
dc.subject.keywordAuthor | Big Data | - |
dc.subject.keywordAuthor | Distributed Hadoop Ecosystem | - |
dc.subject.keywordAuthor | HBase | - |
dc.subject.keywordAuthor | HDFS | - |
dc.subject.keywordAuthor | Spring Framework | - |
dc.subject.keywordAuthor | Web Scraping | - |
dc.description.journalRegisteredClass | scopus | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.