Detailed Information

Cited 0 time in webofscience Cited 0 time in scopus
Metadata Downloads

RNA Sequences-Based Diagnosis of Parkinson's Disease Using Various Feature Selection Methods and Machine Learningopen access

Authors
Kim, JingeunPark, Hye-JinYoon, Yourim
Issue Date
Feb-2023
Publisher
MDPI
Keywords
Parkinson' s disease; RNA sequences; genetic algorithm; information gain; wolf search algorithm; extreme gradient boosting; deep neural network; support vector machine; decision tree
Citation
APPLIED SCIENCES-BASEL, v.13, no.4
Journal Title
APPLIED SCIENCES-BASEL
Volume
13
Number
4
URI
https://scholarworks.bwise.kr/gachon/handle/2020.sw.gachon/87836
DOI
10.3390/app13042698
ISSN
2076-3417
Abstract
Parkinson's disease is a neurodegenerative disease that is associated with genetic and environmental factors. However, the genes causing this degeneration have not been determined, and no reported cure exists for this disease. Recently, studies have been conducted to classify diseases with RNA-seq data using machine learning, and accurate diagnosis of diseases using machine learning is becoming an important task. In this study, we focus on how various feature selection methods can improve the performance of machine learning for accurate diagnosis of Parkinson's disease. In addition, we analyzed the performance metrics and computational costs of running the model with and without various feature selection methods. Experiments were conducted using RNA sequencing-a technique that analyzes the transcription profiling of organisms using next-generation sequencing. Genetic algorithms (GA), information gain (IG), and wolf search algorithm (WSA) were employed as feature selection methods. Machine learning algorithms-extreme gradient boosting (XGBoost), deep neural network (DNN), support vector machine (SVM), and decision tree (DT)-were used as classifiers. Further, the model was evaluated using performance indicators, such as accuracy, precision, recall, F1 score, and receiver operating characteristic (ROC) curve. For XGBoost and DNN, feature selection methods based on GA, IG, and WSA improved the performance of machine learning by 10.00% and 38.18%, respectively. For SVM and DT, performance was improved by 0.91% and 7.27%, respectively, with feature selection methods based on IG and WSA. The results demonstrate that various feature selection methods improve the performance of machine learning when classifying Parkinson's disease using RNA-seq data.
Files in This Item
There are no files associated with this item.
Appears in
Collections
IT융합대학 > 컴퓨터공학과 > 1. Journal Articles
바이오나노대학 > 식품생물공학과 > 1. Journal Articles

qrcode

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher Yoon, You Rim photo

Yoon, You Rim
College of IT Convergence (컴퓨터공학부(컴퓨터공학전공))
Read more

Altmetrics

Total Views & Downloads

BROWSE