Cited 0 time in
Computing paper similarity based on Latent Dirichlet Allocation
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Bae, Duck-Ho | - |
| dc.contributor.author | Yoon, Seok-Ho | - |
| dc.contributor.author | Eom, Tae-Hwan | - |
| dc.contributor.author | Ha, Jiwoon | - |
| dc.contributor.author | Hwang, Young-Sup | - |
| dc.contributor.author | Kim, Sang-Wook | - |
| dc.date.accessioned | 2022-07-16T06:21:45Z | - |
| dc.date.available | 2022-07-16T06:21:45Z | - |
| dc.date.created | 2021-05-13 | - |
| dc.date.issued | 2014-01 | - |
| dc.identifier.issn | 0000-0000 | - |
| dc.identifier.uri | https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/160856 | - |
| dc.description.abstract | This paper discusses methods to compute paper similarity accu- rately using Latent Dirichlet Allocation (LDA). The problems oc- curring when we compute paper similarity based on LDA are as follows. At first, paper similarity in a paper database is hard to be calculated accurately because they are too deficient in text infor- mation, which is caused by the copyright problem and the technical limits of crawling and parsing. Secondly, it is hard to provide the inputs necessary to compute similarity based on LDA. To compute LDA-based similarity, a user should input the topic number and de- termine seed papers as many as the topic number. This paper pro- poses the following methods to solve these two problems. To solve the deficiency of text, we apply the keyword extension method to compute LDA-based similarity. The keyword extension method uses the text referred by the compared paper or text in papers refer- ring the compared paper as text information. To select appropriate seed papers, we propose a method to utilize reference information of the paper compared. Finally, we demonstrate the superiority of the proposed method by experimenting on real papers. | - |
| dc.language | 영어 | - |
| dc.language.iso | en | - |
| dc.publisher | Association for Computing Machinery | - |
| dc.title | Computing paper similarity based on Latent Dirichlet Allocation | - |
| dc.type | Article | - |
| dc.contributor.affiliatedAuthor | Kim, Sang-Wook | - |
| dc.identifier.doi | 10.1145/2557977.2558028 | - |
| dc.identifier.scopusid | 2-s2.0-84899729281 | - |
| dc.identifier.bibliographicCitation | Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2014, pp.1 - 6 | - |
| dc.relation.isPartOf | Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2014 | - |
| dc.citation.title | Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication, ICUIMC 2014 | - |
| dc.citation.startPage | 1 | - |
| dc.citation.endPage | 6 | - |
| dc.type.rims | ART | - |
| dc.type.docType | Conference Paper | - |
| dc.description.journalClass | 1 | - |
| dc.description.isOpenAccess | N | - |
| dc.description.journalRegisteredClass | scopus | - |
| dc.subject.keywordPlus | Communication | - |
| dc.subject.keywordPlus | Information management | - |
| dc.subject.keywordPlus | Statistics | - |
| dc.subject.keywordPlus | Extension methods | - |
| dc.subject.keywordPlus | Keyword extension | - |
| dc.subject.keywordPlus | Latent Dirichlet allocation | - |
| dc.subject.keywordPlus | Latent dirichlet allocations | - |
| dc.subject.keywordPlus | LDA | - |
| dc.subject.keywordPlus | Technical limits | - |
| dc.subject.keywordPlus | Text information | - |
| dc.subject.keywordPlus | Text-based similarity | - |
| dc.subject.keywordPlus | Problem solving | - |
| dc.subject.keywordAuthor | Keyword extension | - |
| dc.subject.keywordAuthor | LDA | - |
| dc.subject.keywordAuthor | Paper database | - |
| dc.subject.keywordAuthor | Text-based similarity | - |
| dc.identifier.url | https://dl.acm.org/doi/10.1145/2557977.2558028 | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
222, Wangsimni-ro, Seongdong-gu, Seoul, 04763, Korea+82-2-2220-1366
COPYRIGHT © 2024 HANYANG UNIVERSITY.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.
