Fast clustering algorithm for information organization
- Authors
- Shin, K.; Han, S.Y.
- Issue Date
- Feb-2003
- Publisher
- SPRINGER-VERLAG BERLIN
- Citation
- COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS, v.2588, pp 619 - 622
- Pages
- 4
- Journal Title
- COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, PROCEEDINGS
- Volume
- 2588
- Start Page
- 619
- End Page
- 622
- URI
- https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/65767
- DOI
- 10.1007/3-540-36456-0_69
- ISSN
- 0302-9743
- Abstract
- This study deals with information organization for more efficient Internet document search and browsing results. As the appropriate algorithm for this purpose, this study proposes the heuristic algorithm, which functions similarly with the star clustering algorithm but performs a more efficient time complexity of O(kn),(k<<n) instead of O(n(2)) found in the star clustering algorithm. The proposed heuristic algorithm applies the cosine similarity and sets vectors composed of the most non-zero elements as the initial standard value. The algorithm is purported to execute the clustering procedure based on the concept vector and produce clusters for information organization in 0(kn) period of time. In order to see how fast the proposed algorithm is in producing clusters for organizing information, the algorithm is tested on TIME and CLASSIC3 in comparison with the star clustering algorithm.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - College of Software > School of Computer Science and Engineering > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/cau/handle/2019.sw.cau/65767)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.