Title :
Cardiovascular Genomics: A Biomarker Identification Pipeline
Author :
Phan, J.H. ; Quo, C.F. ; Wang, M.D.
Author_Institution :
Dept. of Biomed. Eng., Georgia Inst. of Technol. & Emory Univ., Atlanta, GA, USA
Abstract :
Genomic biomarkers are essential for understanding the underlying molecular basis of human diseases such as cardiovascular disease. In this review, we describe a biomarker identification pipeline for cardiovascular disease, which includes 1) high-throughput genomic data acquisition, 2) preprocessing and normalization of data, 3) exploratory analysis, 4) feature selection, 5) classification, and 6) interpretation and validation of candidate biomarkers. We review each step in the pipeline, presenting current and widely used bioinformatics methods. Furthermore, we analyze several publicly available cardiovascular genomics datasets to illustrate the pipeline. Finally, we summarize the current challenges and opportunities for further research.
Keywords :
bioinformatics; cardiovascular system; data acquisition; data analysis; diseases; genomics; medical computing; molecular biophysics; bioinformatics; biomarker identification pipeline; biomarker validation; cardiovascular disease; cardiovascular genomics; cardiovascular genomics datasets; classification; data normalization; data preprocessing; exploratory analysis; feature selection; genomic biomarkers; high-throughput genomic data acquisition; human diseases; molecular basis; Bioinformatics; Diseases; Gene expression; Genomics; Pipelines; Probes; Biomarker identification; bioinformatics pipeline; cardiovascular disease (CVD) risk; classification; genomic microarrays; next-generation sequencing (NGS); Biological Markers; Cardiovascular Diseases; Cluster Analysis; Gene Expression Profiling; Genomics; Humans; Oligonucleotide Array Sequence Analysis;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2012.2199570