Title :
Statistical Analysis of Mascot Peptide Identification with Active Logistic Regression
Author :
Shi, Jinhong ; Lin, Wenjun ; Wu, Fang-Xiang
Author_Institution :
Div. of Biomed. Eng., Univ. of Saskatchewan, Saskatoon, SK, Canada
Abstract :
We apply active learning and logistic regression to perform statistical analysis of Mascot peptide identification.Uncertainty sampling is used to select examples for labeling, and selected examples are labeled with reference data as the oracle. In each iteration of active learning, the penalized Newton-Raphson method is used to solve the logistic regression model. By testing the method on two datasets with known validity, the results have demonstrated that the proposed method can assign accurate probabilities to Mascot peptide identifications and have a high discrimination power to separate correct and incorrect peptide identifications. By use of active learning, superior classifiers have been achieved with a significantly reduced training dataset.
Keywords :
Newton-Raphson method; bioinformatics; learning (artificial intelligence); molecular biophysics; pattern classification; regression analysis; sampling methods; Mascot peptide identification; active learning; active logistic regression; classifiers; penalized Newton-Raphson method; statistical analysis; uncertainty sampling; Labeling; Logistics; Newton method; Peptides; Probability; Proteins; Proteomics; Sampling methods; Statistical analysis; Uncertainty;
Conference_Titel :
Bioinformatics and Biomedical Engineering (iCBBE), 2010 4th International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-4712-1
Electronic_ISBN :
2151-7614
DOI :
10.1109/ICBBE.2010.5516290