Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

DBCURE-MR: An efficient density-based clustering algorithm for large data using MapReduce

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Younghoon-
dc.contributor.authorShim, Kyuseok-
dc.contributor.authorKim, Min-Soeng-
dc.contributor.authorLee, June Sup-
dc.date.accessioned2021-06-22T23:22:46Z-
dc.date.available2021-06-22T23:22:46Z-
dc.date.issued2014-06-
dc.identifier.issn0306-4379-
dc.identifier.issn1873-6076-
dc.identifier.urihttps://scholarworks.bwise.kr/erica/handle/2021.sw.erica/22821-
dc.description.abstractClustering is a useful data mining technique which groups data points such that the points within a single group have similar characteristics, while the points in different groups are dissimilar. Density-based clustering algorithms such as DBSCAN and OPTICS are one kind of widely used clustering algorithms. As there is an increasing trend of applications to deal with vast amounts of data, clustering such big data is a challenging problem. Recently, parallelizing clustering algorithms on a large cluster of commodity machines using the MapReduce framework have received a lot of attention. In this paper, we first propose the new density-based clustering algorithm, called DBCURE, which is robust to find clusters with varying densities and suitable for parallelizing the algorithm with MapReduce. We next develop DBCURE-MR, which is a parallelized DBCURE using MapReduce. While traditional density-based algorithms find each cluster one by one, our DBCURE-MR finds several clusters together in parallel. We prove that both DBCURE and DBCURE-MR find the clusters correctly based on the definition of density-based clusters. Our experimental results with various data sets confirm that DBCURE-MR finds clusters efficiently without being sensitive to the clusters with varying densities and scales up well with the MapReduce framework. (C) 2013 Published by Elsevier Ltd.-
dc.format.extent21-
dc.language영어-
dc.language.isoENG-
dc.publisherElsevier Science & Technology-
dc.titleDBCURE-MR: An efficient density-based clustering algorithm for large data using MapReduce-
dc.typeArticle-
dc.publisher.location영국-
dc.identifier.doi10.1016/j.is.2013.11.002-
dc.identifier.scopusid2-s2.0-84896527723-
dc.identifier.wosid000333785700002-
dc.identifier.bibliographicCitationInformation Systems, v.42, pp 15 - 35-
dc.citation.titleInformation Systems-
dc.citation.volume42-
dc.citation.startPage15-
dc.citation.endPage35-
dc.type.docTypeArticle-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClasssci-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaComputer Science-
dc.relation.journalWebOfScienceCategoryComputer Science, Information Systems-
dc.subject.keywordAuthorClustering algorithm-
dc.subject.keywordAuthorDensity-based clustering-
dc.subject.keywordAuthorParallel algorithm-
dc.subject.keywordAuthorMapReduce-
dc.identifier.urlhttps://www.sciencedirect.com/science/article/pii/S0306437913001634?via%3Dihub-
Files in This Item
Go to Link
Appears in
Collections
COLLEGE OF COMPUTING > DEPARTMENT OF ARTIFICIAL INTELLIGENCE > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Young hoon photo

Kim, Young hoon
COLLEGE OF COMPUTING (DEPARTMENT OF ARTIFICIAL INTELLIGENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE