Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

C-Rank: A link-based similarity measure for scientific literature databases

Authors
Yoon, Seok-HoKim, Sang-WookPark, Sunju
Issue Date
Jan-2016
Publisher
ELSEVIER SCIENCE INC
Keywords
Scientific literature; Link-based similarity measure
Citation
INFORMATION SCIENCES, v.326, pp.25 - 40
Indexed
SCIE
SCOPUS
Journal Title
INFORMATION SCIENCES
Volume
326
Start Page
25
End Page
40
URI
https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/155299
DOI
10.1016/j.ins.2015.07.036
ISSN
0020-0255
Abstract
As the number of people who use scientific literature databases has grown, the demand for literature retrieval services has steadily increased. One of the most popular retrieval service methods is to find a set of papers similar to the paper under consideration, which requires a measure that computes the similarities between the papers. Scientific literature databases exhibit two interesting characteristics that are not found in general databases. First, the papers cited by older papers are often not included in the database due to technical and economic reasons. Second, since a paper references previously published papers, few papers cite recently published papers. These two characteristics cause all existing similarity measures to fail in at least one of the following cases: (1) measuring the similarity between old, but similar papers, (2) measuring the similarity between recent, but similar papers, and (3) measuring the similarity between two similar papers: one old, the other recent. In this paper, we propose a new link-based similarity measure called C-Rank, which uses both in-link and out-link references, disregarding the direction of the references. In addition, we discuss the most suitable normalization method for scientific literature databases and we propose an evaluation method for measuring the accuracy of similarity measures. For the experiments, we used real-world papers from DBLP's database with reference information crawled from Libra. We then compared the performance of C-Rank with that of existing similarity measures. Experimental results showed that C-Rank achieved a higher accuracy than existing similarity measures.
Files in This Item
Go to Link
Appears in
Collections
서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Sang-Wook photo

Kim, Sang-Wook
COLLEGE OF ENGINEERING (SCHOOL OF COMPUTER SCIENCE)
Read more

Altmetrics

Total Views & Downloads

BROWSE