Cited 0 time in
Efficient link-based clustering in a large scaled blog network
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Yoon, Seok-Ho | - |
| dc.contributor.author | Song, Suk-Soon | - |
| dc.contributor.author | Kim, Sang-Wook | - |
| dc.date.accessioned | 2022-07-16T22:00:08Z | - |
| dc.date.available | 2022-07-16T22:00:08Z | - |
| dc.date.created | 2021-05-11 | - |
| dc.date.issued | 2011-02 | - |
| dc.identifier.issn | 0000-0000 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/169150 | - |
| dc.description.abstract | In this paper, we address efficient processing of link-based clustering in large-scaled data environment. LinkClus is a link-based clustering method that provides good accuracy and reasonable performance. This paper first shows that this method is not sufficiently scalable to be applied to a huge volume of real-world blog data. Then, we observe that the performance bottleneck of LinkClus exists on the initial clustering step. We propose a new method to get over this performance bottleneck. The proposed method first identifies the seed sets for initial clustering efficiently. Here, each seed set consists of a small number (=2∼3) of objects that are highly similar to one another. The method then adds every other object into one of seed sets that are the most similar to the object. It also eliminates those objects of very few links that negatively affect the accuracy, thereby enhancing the overall processing performance. Via experiments with real-world blog data, we verify the scalability and accuracy of the proposed method. | - |
| dc.language | 영어 | - |
| dc.language.iso | en | - |
| dc.publisher | Association for Computing Machinery | - |
| dc.title | Efficient link-based clustering in a large scaled blog network | - |
| dc.type | Article | - |
| dc.contributor.affiliatedAuthor | Kim, Sang-Wook | - |
| dc.identifier.doi | 10.1145/1968613.1968699 | - |
| dc.identifier.scopusid | 2-s2.0-79956044902 | - |
| dc.identifier.bibliographicCitation | Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2011, pp.1 - 5 | - |
| dc.relation.isPartOf | Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2011 | - |
| dc.citation.title | Proceedings of the 5th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2011 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 5 | - |
| dc.type.rims | ART | - |
| dc.type.docType | Conference Paper | - |
| dc.description.journalClass | 1 | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordPlus | Blog data | - |
| dc.subject.keywordPlus | Blog networks | - |
| dc.subject.keywordPlus | Clustering methods | - |
| dc.subject.keywordPlus | Data environment | - |
| dc.subject.keywordPlus | Link-based clustering | - |
| dc.subject.keywordPlus | LinkClus | - |
| dc.subject.keywordPlus | Performance bottlenecks | - |
| dc.subject.keywordPlus | Processing performance | - |
| dc.subject.keywordPlus | Seed set | - |
| dc.subject.keywordPlus | Blogs | - |
| dc.subject.keywordPlus | Seed | - |
| dc.subject.keywordPlus | Information management | - |
| dc.subject.keywordAuthor | Blog data | - |
| dc.subject.keywordAuthor | Link-based clustering | - |
| dc.subject.keywordAuthor | LinkClus | - |
| dc.identifier.url | https://dl.acm.org/doi/10.1145/1968613.1968699 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
