Title :
Competitive and self-contained gene set analysis methods applied for class prediction
Author :
Maciejewski, Henryk
Author_Institution :
Inst. of Comput. Eng., Control & Robot., Wroclaw Univ. of Technol., Wrocław, Poland
Abstract :
This paper compares two methodologically different approaches to gene set analysis applied for selection of features for sample classification based on microarray studies. We analyze competitive and self-contained methods in terms of predictive performance of features generated from most differentially expressed gene sets (pathways) identified with these approaches. We also observe stability of features returned. We use the features to train several classifiers (e.g., SVM, random forest, nearest shrunken centroids, etc.) We generally observe smaller classification errors and better stability of features produced by the self-contained algorithm. This comparative study is based on the leukemia data set published in [3].
Keywords :
bioinformatics; genetics; pattern classification; support vector machines; SVM; class prediction; leukemia data set; microarray study; nearest shrunken centroids; predictive performance; random forest; sample classification; self-contained gene set analysis method; Algorithm design and analysis; Logistics; Prediction algorithms; Radiation detectors; Stability analysis; Support vector machines; Training data;
Conference_Titel :
Computer Science and Information Systems (FedCSIS), 2011 Federated Conference on
Conference_Location :
Szczecin
Print_ISBN :
978-1-4577-0041-5
Electronic_ISBN :
978-83-60810-35-4