Transcriptomics in Toxicogenomics, Part II: Preprocessing and Differential Expression Analysis for High Quality Dataopen access
- Authors
- Federico, Antonio; Serra, Angela; Ha, My Kieu; Kohonen, Pekka; Choi, Jang-Sik; Liampa, Irene; Nymark, Penny; Sanabria, Natasha; Cattelani, Luca; Fratello, Michele; Kinaret, Pia Anneli Sofia; Jagiello, Karolina; Puzyn, Tomasz; Melagraki, Georgia; Gulumian, Mary; Afantitis, Antreas; Sarimveis, Haralambos; Yoon, Tae-Hyun; Grafstrom, Roland; Greco, Dario
- Issue Date
- May-2020
- Publisher
- MDPI
- Keywords
- toxicogenomics; transcriptomics; RNA-Seq; scRNA-Seq; microarray; data preprocessing; quality check; normalization; batch effect; differential expression
- Citation
- NANOMATERIALS, v.10, no.5, pp.1 - 21
- Indexed
- SCIE
SCOPUS
- Journal Title
- NANOMATERIALS
- Volume
- 10
- Number
- 5
- Start Page
- 1
- End Page
- 21
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/145753
- DOI
- 10.3390/nano10050903
- ISSN
- 2079-4991
- Abstract
- Preprocessing of transcriptomics data plays a pivotal role in the development of toxicogenomics-driven tools for chemical toxicity assessment. The generation and exploitation of large volumes of molecular profiles, following an appropriate experimental design, allows the employment of toxicogenomics (TGx) approaches for a thorough characterisation of the mechanism of action (MOA) of different compounds. To date, a plethora of data preprocessing methodologies have been suggested. However, in most cases, building the optimal analytical workflow is not straightforward. A careful selection of the right tools must be carried out, since it will affect the downstream analyses and modelling approaches. Transcriptomics data preprocessing spans across multiple steps such as quality check, filtering, normalization, batch effect detection and correction. Currently, there is a lack of standard guidelines for data preprocessing in the TGx field. Defining the optimal tools and procedures to be employed in the transcriptomics data preprocessing will lead to the generation of homogeneous and unbiased data, allowing the development of more reliable, robust and accurate predictive models. In this review, we outline methods for the preprocessing of three main transcriptomic technologies including microarray, bulk RNA-Sequencing (RNA-Seq), and single cell RNA-Sequencing (scRNA-Seq). Moreover, we discuss the most common methods for the identification of differentially expressed genes and to perform a functional enrichment analysis. This review is the second part of a three-article series on Transcriptomics in Toxicogenomics.
- Files in This Item
-
- Appears in
Collections - 서울 자연과학대학 > 서울 화학과 > 1. Journal Articles
Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.