Title :
An Extended Markov Blanket Approach to Proteomic Biomarker Detection From High-Resolution Mass Spectrometry Data
Author :
Oh, Jung Hun ; Gurnani, Prem ; Schorge, John ; Rosenblatt, Kevin P. ; Gao, Jean X.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Texas, Arlington, TX
fDate :
3/1/2009 12:00:00 AM
Abstract :
High-resolution matrix-assisted laser desorption/ionization time-of-flight mass spectrometry has recently shown promise as a screening tool for detecting discriminatory peptide/protein patterns. The major computational obstacle in finding such patterns is the large number of mass/charge peaks (features, biomarkers, data points) in a spectrum. To tackle this problem, we have developed methods for data preprocessing and biomarker selection. The preprocessing consists of binning, baseline correction, and normalization. An algorithm, extended Markov blanket, is developed for biomarker detection, which combines redundant feature removal and discriminant feature selection. The biomarker selection couples with support vector machine to achieve sample prediction from high-resolution proteomic profiles. Our algorithm is applied to recurrent ovarian cancer study that contains platinum-sensitive and platinum-resistant samples after treatment. Experiments show that the proposed method performs better than other feature selection algorithms. In particular, our algorithm yields good performance in terms of both sensitivity and specificity as compared to other methods.
Keywords :
Markov processes; biochemistry; biomedical measurement; cancer; data analysis; feature extraction; mass spectroscopic chemical analysis; medical diagnostic computing; photon stimulated desorption; proteins; proteomics; support vector machines; time of flight mass spectra; tumours; baseline correction; data binning; data preprocessing; discriminant feature selection; extended Markov blanket approach; high-resolution mass spectrometry; laser ionization time-of-flight mass spectrometry; matrix-assisted laser desorption; normalization; peptide-protein pattern detection; platinum-resistant samples; platinum-sensitive samples; proteomic biomarker detection; recurrent ovarian cancer; redundant feature removal; support vector machine; Biomarker; Markov blanket (MB); feature selection; matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF); ovarian cancer; preprocessing; Algorithms; Artificial Intelligence; Biological Markers; Databases, Protein; Female; Humans; Markov Chains; Normal Distribution; Ovarian Neoplasms; Protein Array Analysis; Proteomics; Sensitivity and Specificity; Spectrometry, Mass, Matrix-Assisted Laser Desorption-Ionization;
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
DOI :
10.1109/TITB.2008.2007909