Cited 0 time in
SimCC: A novel method to consider both content and citations for computing similarity of scientific papers
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Hamedani, Masoud Reyhani | - |
| dc.contributor.author | Kim, Sang-Wook | - |
| dc.contributor.author | Kim, Dong-Jin | - |
| dc.date.accessioned | 2022-07-15T18:16:23Z | - |
| dc.date.available | 2022-07-15T18:16:23Z | - |
| dc.date.issued | 2016-03 | - |
| dc.identifier.issn | 0020-0255 | - |
| dc.identifier.issn | 1872-6291 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/155042 | - |
| dc.description.abstract | To compute the similarity of scientific papers, text-based similarity measures, link-based similarity measures, and hybrid methods can be applied. The text-based and link-based similarity measures take into account only a single aspect of scientific papers, content or citations, respectively. The hybrid methods consider both content and citations; however, they do not carefully consider the relation between the content of a pair of papers involved in a citation relationship. In this paper, we propose a novel method, SimCC (similarity based on content and citations), that considers both aspects, content and citations, to compute the similarity of scientific papers. Unlike previous methods, SimCC effectively reflects both content and authority of scientific papers simultaneously in similarity computation by applying a new RA (relevance and authority) weighting scheme. Also, we propose an RA+R weighting scheme to consider the recency of papers and an RA+E weighting scheme to take into account the author expertise of papers in similarity computation. The effectiveness of our proposed method is demonstrated by extensive experiments on a real-world dataset of scientific papers. The results show that our method achieves more than 100% improvement in accuracy in comparison with previous methods. | - |
| dc.format.extent | 20 | - |
| dc.language | 영어 | - |
| dc.language.iso | ENG | - |
| dc.publisher | Elsevier BV | - |
| dc.title | SimCC: A novel method to consider both content and citations for computing similarity of scientific papers | - |
| dc.type | Article | - |
| dc.publisher.location | 미국 | - |
| dc.identifier.doi | 10.1016/j.ins.2015.12.001 | - |
| dc.identifier.scopusid | 2-s2.0-84959419098 | - |
| dc.identifier.wosid | 000370088500014 | - |
| dc.identifier.bibliographicCitation | Information Sciences, v.334, pp 273 - 292 | - |
| dc.citation.title | Information Sciences | - |
| dc.citation.volume | 334 | - |
| dc.citation.startPage | 273 | - |
| dc.citation.endPage | 292 | - |
| dc.type.docType | Article | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | sci | - |
| dc.description.journalRegisteredClass | scie | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.relation.journalResearchArea | Computer Science | - |
| dc.relation.journalWebOfScienceCategory | Computer Science, Information Systems | - |
| dc.subject.keywordPlus | INFORMATION | - |
| dc.subject.keywordAuthor | Citations | - |
| dc.subject.keywordAuthor | Content | - |
| dc.subject.keywordAuthor | Contribution score | - |
| dc.subject.keywordAuthor | Scientific papers | - |
| dc.subject.keywordAuthor | Similarity | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
