On exploiting content and citations together to compute similarity of scientific papers
- Authors
- Hamedani, Masoud Reyhani; Kim, Sang-Wook; Lee, Sang-Chul; Kim, Dong-Jin
- Issue Date
- Oct-2013
- Publisher
- Association for Computing Machinary, Inc.
- Keywords
- Authority; Citation; Content; Scientific papers; Similarity
- Citation
- International Conference on Information and Knowledge Management, Proceedings, pp.1553 - 1556
- Indexed
- SCOPUS
- Journal Title
- International Conference on Information and Knowledge Management, Proceedings
- Start Page
- 1553
- End Page
- 1556
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/161771
- DOI
- 10.1145/2505515.2507842
- ISSN
- 0000-0000
- Abstract
- In computing the similarity of scientific papers, previous text-based and link-based similarity measures look at only a single side of the content and citations. In this paper, we propose a novel approach called SimCC that effectively combines the content and citation information to accurately compute the similarity of scientific papers. Unlike previous approaches, SimCC effectively represents both authority and context of a scientific paper simultaneously in computing similarities. Also, we propose SimCC+A to consider recently-published papers. The effectiveness of our proposed method is demonstrated via extensive experiments on a real-world dataset of scientific papers, with more than 100% improvement in accuracy compared with previous methods.
- Files in This Item
-
Go to Link
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.