Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

Semi-Parallel logistic regression for GWAS on encrypted data

Full metadata record
DC Field Value Language
dc.contributor.authorKim, Miran-
dc.contributor.authorSong, Yongsoo-
dc.contributor.authorLi, Baiyu-
dc.contributor.authorMicciancio, Daniele-
dc.date.accessioned2023-09-04T08:03:55Z-
dc.date.available2023-09-04T08:03:55Z-
dc.date.created2023-07-19-
dc.date.issued2020-07-
dc.identifier.issn17558794-
dc.identifier.urihttps://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/190064-
dc.description.abstractBackground The sharing of biomedical data is crucial to enable scientific discoveries across institutions and improve health care. For example, genome-wide association studies (GWAS) based on a large number of samples can identify disease-causing genetic variants. The privacy concern, however, has become a major hurdle for data management and utilization. Homomorphic encryption is one of the most powerful cryptographic primitives which can address the privacy and security issues. It supports the computation on encrypted data, so that we can aggregate data and perform an arbitrary computation on an untrusted cloud environment without the leakage of sensitive information. Methods This paper presents a secure outsourcing solution to assess logistic regression models for quantitative traits to test their associations with genotypes. We adapt the semi-parallel training method by Sikorska et al., which builds a logistic regression model for covariates, followed by one-step parallelizable regressions on all individual single nucleotide polymorphisms (SNPs). In addition, we modify our underlying approximate homomorphic encryption scheme for performance improvement. Results We evaluated the performance of our solution through experiments on real-world dataset. It achieves the best performance of homomorphic encryption system for GWAS analysis in terms of both complexity and accuracy. For example, given a dataset consisting of 245 samples, each of which has 10643 SNPs and 3 covariates, our algorithm takes about 43 seconds to perform logistic regression based genome wide association analysis over encryption. Conclusions We demonstrate the feasibility and scalability of our solution.-
dc.language영어-
dc.language.isoen-
dc.publisherBMC-
dc.titleSemi-Parallel logistic regression for GWAS on encrypted data-
dc.typeArticle-
dc.contributor.affiliatedAuthorKim, Miran-
dc.identifier.doi10.1186/s12920-020-0724-z-
dc.identifier.scopusid2-s2.0-85088509481-
dc.identifier.wosid000553597700008-
dc.identifier.bibliographicCitationBMC MEDICAL GENOMICS, v.13, pp.1 - 13-
dc.relation.isPartOfBMC MEDICAL GENOMICS-
dc.citation.titleBMC MEDICAL GENOMICS-
dc.citation.volume13-
dc.citation.startPage1-
dc.citation.endPage13-
dc.type.rimsART-
dc.type.docType정기학술지(Article(Perspective Article포함))-
dc.description.journalClass1-
dc.description.isOpenAccessY-
dc.description.journalRegisteredClassscie-
dc.description.journalRegisteredClassscopus-
dc.relation.journalResearchAreaGenetics & Heredity-
dc.relation.journalWebOfScienceCategoryGenetics & Heredity-
dc.subject.keywordPlusSEARCH-AND-COMPUTE-
dc.subject.keywordAuthorGenome-wide association studies-
dc.subject.keywordAuthorHomomorphic encryption-
dc.subject.keywordAuthorLogistic regression-
dc.identifier.urlhttps://bmcmedgenomics.biomedcentral.com/articles/10.1186/s12920-020-0724-z-
Files in This Item
Appears in
Collections
서울 자연과학대학 > 서울 수학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Kim, Miran photo

Kim, Miran
COLLEGE OF NATURAL SCIENCES (DEPARTMENT OF MATHEMATICS)
Read more

Altmetrics

Total Views & Downloads

BROWSE