Design and implementation of web scraping platform using spring framework based on distributed hadoop ecosystem

Yoo J.-S.; Lee M.-H.

Detailed Information

Cited 0 time in webofscience

Cited 0 time in scopus

Metadata Downloads

Design and implementation of web scraping platform using spring framework based on distributed hadoop ecosystem

Full metadata record

DC Field	Value	Language
dc.contributor.author	Yoo J.-S.	-
dc.contributor.author	Lee M.-H.	-
dc.date.available	2020-04-06T07:41:30Z	-
dc.date.created	2020-04-02	-
dc.date.issued	2019-09	-
dc.identifier.issn	2005-4238	-
dc.identifier.uri	https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/26535	-
dc.description.abstract	Background/Objectives: Recently, a rapid increase of general data and informal data and the quick data generation in all fields have a great effect on how to utilize data.The demand is increasing for utilizing Big Data in making important decisions for organizations by interpreting various patterns in a number of heterogeneous data, and predicting the future. Methods/Statistical analysis: Furthermore, it is necessary to provide services quickly based on up-to-date information. However, in most research, the collection, loading and processing of data have been insufficient and great attention has been paid to the analysis of data. Findings: Thus, this research collects the data searched with keywords through the Spring Framework using next generation web standards and through Web scraping based on the Hadoop 2.0 Ecosystem, loads the collected data on to a Hadoop Distributed File System (HDFS) and HBase, and designs and implements a Big Data utilization system that can schematize, through a word cloud, the results of analysis of keyword, title, contents and morpheme on the basis of contents and nouns extracted from the loaded data with a Twitter morpheme analyzer. Improvements/Applications: This research intends to provide a platform reference model that is applicable to enterprise groupware to which the Distributed Hadoop Ecosystem and the Spring Framework under next generation web standards are applied. © 2019 SERSC.	-
dc.language	영어	-
dc.language.iso	en	-
dc.publisher	Science and Engineering Research Support Society	-
dc.relation.isPartOf	International Journal of Advanced Science and Technology	-
dc.title	Design and implementation of web scraping platform using spring framework based on distributed hadoop ecosystem	-
dc.type	Article	-
dc.type.rims	ART	-
dc.description.journalClass	1	-
dc.identifier.bibliographicCitation	International Journal of Advanced Science and Technology, v.28, no.5, pp.174 - 182	-
dc.description.isOpenAccess	N	-
dc.identifier.scopusid	2-s2.0-85080111796	-
dc.citation.endPage	182	-
dc.citation.startPage	174	-
dc.citation.title	International Journal of Advanced Science and Technology	-
dc.citation.volume	28	-
dc.citation.number	5	-
dc.contributor.affiliatedAuthor	Yoo J.-S.	-
dc.type.docType	Article	-
dc.subject.keywordAuthor	Big Data	-
dc.subject.keywordAuthor	Distributed Hadoop Ecosystem	-
dc.subject.keywordAuthor	HBase	-
dc.subject.keywordAuthor	HDFS	-
dc.subject.keywordAuthor	Spring Framework	-
dc.subject.keywordAuthor	Web Scraping	-
dc.description.journalRegisteredClass	scopus	-

Files in This Item: There are no files associated with this item.

Appears in Collections: 공과대학 > 산업경영공학과 > 1. Journal Articles

Show simple item record

qrcode

Related Researcher

Researcher Yoo, Jung Sang photo

Yoo, Jung Sang: Engineering (Department of Industrial Engineering)

Read more

Altmetrics

Total Views & Downloads

STATISTICS: Total View :4,202,560; Today View :7,195

RSS_1.0 RSS_2.0 ATOM_1.0

1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114

Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.

Detailed Information

Related Researcher

Altmetrics

Total Views & Downloads

BROWSE