합성 데이터를 활용한 폐암 환자의 생존분석 가능성 검정
DC Field | Value | Language |
---|---|---|
dc.contributor.author | 유제형 | - |
dc.contributor.author | 이승희 | - |
dc.contributor.author | 김종엽 | - |
dc.contributor.author | 손지웅 | - |
dc.contributor.author | 구관우 | - |
dc.contributor.author | 이수현 | - |
dc.date.accessioned | 2023-06-17T07:40:29Z | - |
dc.date.available | 2023-06-17T07:40:29Z | - |
dc.date.created | 2023-06-17 | - |
dc.date.issued | 2022-11 | - |
dc.identifier.issn | 2465-8014 | - |
dc.identifier.uri | https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/88117 | - |
dc.description.abstract | Objectives: This was a pilot study to investigate the possibility of clinical analysis to support the lack of sample size of real data and to generate synthetic data. Since real data has many limitations, such as ethical issues and costly issues, there have been many attempts to create realistic synthetic data. The focus is on whether synthetic data can be used instead of real data. Methods: This study analyzed 11,978 lung cancer patients who used anticancer drug therapy using synthetic data as a quasi-experimental study. Clinically significant variables were extracted and some tables containing patient status and treatment records were preprocessed. This experiment was applied to the propensity score matching technique to prevent the bias of covariates. Then, the preprocessed data were analyzed using Kaplan-Meier estimation and Cox proportional hazards model. Results: When plotting the survival curves, the curves from the synthetic data did not match the curves for the actual data of the other covariates. In Cohort 1, Gen I had a better 5-year OS than Gen II [S1 = 0.973, S2 = 0.953, p < 0.05]. Similarly, Gen I anti-cancer was better than Gen III in Cohort 2 [S1 = 0.990, S3 = 0.884, p < 0.05]. In the exploratory sub- group analysis using the Cox regression model, the risk ratio was estimated. We found that Gen I had a better effect on HR than Gen II and III. However, those results were different from the actual trend. Conclusions: It was found that the analysis based on the DATA-FREE-BOX data was different from the trend of the survival analysis conducted with the real data. The trend of this analysis could be different from the real trend. It will be able to contribute to data-validation. Moreover, it is expected that the same methodology can be applied in clinical studies based on actual data by utilizing the technique used in this study. | - |
dc.language | 한국어 | - |
dc.language.iso | ko | - |
dc.publisher | 한국보건정보통계학회 | - |
dc.relation.isPartOf | 보건정보통계학회지 | - |
dc.title | 합성 데이터를 활용한 폐암 환자의 생존분석 가능성 검정 | - |
dc.title.alternative | A Study on the Availability of Survival Analysis of Lung Cancer Patients Using Synthetic Data | - |
dc.type | Article | - |
dc.type.rims | ART | - |
dc.description.journalClass | 2 | - |
dc.identifier.doi | 10.21032/jhis.2022.47.4.279 | - |
dc.identifier.bibliographicCitation | 보건정보통계학회지, v.47, no.4, pp.279 - 289 | - |
dc.identifier.kciid | ART002903231 | - |
dc.description.isOpenAccess | Y | - |
dc.citation.endPage | 289 | - |
dc.citation.startPage | 279 | - |
dc.citation.title | 보건정보통계학회지 | - |
dc.citation.volume | 47 | - |
dc.citation.number | 4 | - |
dc.contributor.affiliatedAuthor | 이수현 | - |
dc.subject.keywordAuthor | Survival analysis | - |
dc.subject.keywordAuthor | Synthetic data | - |
dc.subject.keywordAuthor | Kaplan-Meier estimation | - |
dc.subject.keywordAuthor | Cox regression model | - |
dc.subject.keywordAuthor | Lung cancer | - |
dc.description.journalRegisteredClass | kci | - |
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.
1342, Seongnam-daero, Sujeong-gu, Seongnam-si, Gyeonggi-do, Republic of Korea(13120)031-750-5114
COPYRIGHT 2020 Gachon University All Rights Reserved.
Certain data included herein are derived from the © Web of Science of Clarivate Analytics. All rights reserved.
You may not copy or re-distribute this material in whole or in part without the prior written consent of Clarivate Analytics.