Title :
Prediction of labor for pregnant women using high-resolution mass spectrometry data
Author :
Oh, Jung Hun ; Nandi, Animesh ; Gurnani, Prem ; Bryant-Greenwood, Peter ; Rosenblatt, Kevin P. ; Gao, Jean
Author_Institution :
Dept. of Comput. Sci. Eng., Texas Univ., Arlington, TX
Abstract :
High-resolution MALDI-TOF (matrix-assisted laser desorption/ionization time-of-flight) mass spectrometry has shown promise as a screening tool for detecting discriminatory protein patterns. The major computational obstacle in analyzing MALDI-TOF data is a large number of mass/charge peaks (a.k.a. features, data points). With the number of data points easily going beyond one million for a single sample, efficient feature selection is critical for unequivocal protein pattern discovery. To tackle this problem, we have developed a multi-step strategy for data preprocessing and afterwards feature selection. The preprocessing is composed of binning, baseline correction, and normalization. For the preprocessed data, we propose a new feature subset selection method that is a hybrid filter/wrapper approach. Based on the two feature subsets for each feature, high and low correlated subsets, a feature is assigned a weight which indicates the extent of feature importance. Our scheme is applied to the analysis of labor dataset to predict delivery time of pregnant women. To validate the performance of the proposed algorithm, experiments are performed in comparison with other feature selection and classification methods. We show that our proposed approach outperforms other algorithms
Keywords :
biochemistry; biomedical measurement; feature extraction; laser applications in medicine; mass spectroscopic chemical analysis; medical computing; molecular biophysics; obstetrics; pattern classification; photoionisation; photon stimulated desorption; proteins; time of flight mass spectroscopy; classification methods; data preprocessing; feature subset selection method; high-resolution time-of-flight mass spectrometry data; hybrid filter; hybrid wrapper approach; labor prediction; matrix-assisted laser desorption; matrix-assisted laser ionization; pregnant women; protein patterns; unequivocal protein pattern discovery; Biomarkers; Cancer; Data analysis; Diseases; Ionization; Mass spectroscopy; Pathology; Pregnancy; Proteins; Proteomics;
Conference_Titel :
BioInformatics and BioEngineering, 2006. BIBE 2006. Sixth IEEE Symposium on
Conference_Location :
Arlington, VA
Print_ISBN :
0-7695-2727-2
DOI :
10.1109/BIBE.2006.253298