Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

C-Affinity: A Novel Similarity Measure for Effective Data Clustering

Full metadata record
DC FieldValueLanguage
dc.contributor.authorHong, Jiwon-
dc.contributor.authorKim, Sang-Wook-
dc.date.accessioned2023-06-01T07:00:18Z-
dc.date.available2023-06-01T07:00:18Z-
dc.date.issued2023-04-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/185837-
dc.description.abstractClustering is widely employed in various applications as it is one of the most useful data mining techniques. In performing clustering, a similarity measure, which defines how similar a pair of data objects are, plays an important role. A similarity measure is employed by considering a target dataset's characteristics. Current similarity measures (or distances) do not reflect the distribution of data objects in a dataset at all. From the clustering point of view, this fact may limit the clustering accuracy. In this paper, we propose c-affinity, a new notion of a similarity measure that reflects the distribution of objects in the given dataset from a clustering point of view. We design c-affinity between any two objects to have a higher value as they are more likely to belong to the same cluster by learning the data distribution. We use random walk with restart (RWR) on the k-nearest neighbor graph of the given dataset to measure (1) how similar a pair of objects are and (2) how densely other objects are distributed between them. Via extensive experiments on sixteen synthetic and real-world datasets, we verify that replacing the existing similarity measure with our c-affinity improves the clustering accuracy significantly.-
dc.format.extent4-
dc.language영어-
dc.language.isoENG-
dc.publisherAssociation for Computing Machinery, Inc-
dc.titleC-Affinity: A Novel Similarity Measure for Effective Data Clustering-
dc.typeArticle-
dc.publisher.location미국-
dc.identifier.doi10.1145/3543873.3587307-
dc.identifier.scopusid2-s2.0-85159592684-
dc.identifier.bibliographicCitationACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023, pp 41 - 44-
dc.citation.titleACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023-
dc.citation.startPage41-
dc.citation.endPage44-
dc.type.docTypeConference paper-
dc.description.isOpenAccessN-
dc.description.journalRegisteredClassscopus-
dc.subject.keywordPlusC (programming language)-
dc.subject.keywordPlusCluster analysis-
dc.subject.keywordPlusClustering algorithms-
dc.subject.keywordPlusData mining-
dc.subject.keywordPlusNearest neighbor search-
dc.subject.keywordPlusClustering accuracy-
dc.subject.keywordPlusClustering affinity-
dc.subject.keywordPlusClusterings-
dc.subject.keywordPlusData clustering-
dc.subject.keywordPlusData objects-
dc.subject.keywordPlusNear neighbor graph-
dc.subject.keywordPlusNearest-neighbour-
dc.subject.keywordPlusNeighbor graph-
dc.subject.keywordPlusSimilarity measure-
dc.subject.keywordAuthorclustering-
dc.subject.keywordAuthorclustering affinity-
dc.subject.keywordAuthornearest neighbor graph-
dc.subject.keywordAuthorsimilarity measure-
dc.identifier.urlhttps://dl.acm.org/doi/10.1145/3543873.3587307-
Files in This Item
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Sang-Wook photo

Kim, Sang-Wook
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE