A literature-driven method to calculate similarities among diseases
- Authors
- Kim, Hyunjin; Yoon, Youngmi; Ahn, Jaegyoon; Park, Sanghyun
- Issue Date
- Nov-2015
- Publisher
- ELSEVIER IRELAND LTD
- Keywords
- Disease network; Disease-disease similarity; Biomedical text mining
- Citation
- COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, v.122, no.2, pp.108 - 122
- Journal Title
- COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE
- Volume
- 122
- Number
- 2
- Start Page
- 108
- End Page
- 122
- URI
- https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/9982
- DOI
- 10.1016/j.cmpb.2015.07.001
- ISSN
- 0169-2607
- Abstract
- Background: "Our lives are connected by a thousand invisible threads and along these sympathetic fibers, our actions run as causes and return to us as results". It is Herman Melville's famous quote describing connections among human lives. To paraphrase the Melville's quote, diseases are connected by many functional threads and along these sympathetic fibers, diseases run as causes and return as results. The Melville's quote explains the reason for researching disease-disease similarity and disease network. Measuring similarities between diseases and constructing disease network can play an important role in disease function research and in disease treatment. To estimate disease-disease similarities, we proposed a novel literature-based method. Methods and results: The proposed method extracted disease-gene relations and disease-drug relations from literature and used the frequencies of occurrence of the relations as features to calculate similarities among diseases. We also constructed disease network with top-ranking disease pairs from our method. The proposed method discovered a larger number of answer disease pairs than other comparable methods and showed the lowest p-value. Conclusions: We presume that our method showed good results because of using literature data, using all possible gene symbols and drug names for features of a disease, and determining feature values of diseases with the frequencies of co-occurrence of two entities. The disease-disease similarities from the proposed method can be used in computational biology researches which use similarities among diseases. (C) 2015 Elsevier Ireland Ltd. All rights reserved.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - IT융합대학 > 컴퓨터공학과 > 1. Journal Articles
![qrcode](https://api.qrserver.com/v1/create-qr-code/?size=55x55&data=https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/9982)
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.