DocumentCode :
3501328
Title :
Feature selection of pathway markers for microarray-based disease classification using negatively correlated feature sets
Author :
Chan, Jonathan H. ; Sootanan, Pitak ; Larpeampaisarl, Ponlavit
Author_Institution :
Sch. of Inf. Technol., King Mongkut´´s Univ. of Technol. Thonburi, Bangkok, Thailand
fYear :
2011
fDate :
July 31 2011-Aug. 5 2011
Firstpage :
3293
Lastpage :
3299
Abstract :
Microarray-based classification of disease states is based on gene expression profiles of subjects. Various methods have been proposed to identify diagnostic markers that can accurately discriminate between two classes such as case and control. Many of the methods that used only a subset of ranked genes in the pathway may not be able to fully represent the classification boundaries for the two disease classes. The use of negatively correlated feature sets (NCFS) for identifying phenotype-correlated genes (PCOGs) and inferring pathway activities is used here. The NCFS-based pathway activity inference schemes significantly improved the power of pathway markers to discriminate between normal and cancer, as well as relapse and non-relapse, classes in microarray expression datasets of breast cancer. Furthermore, the use of ranker feature selection methods with top 3 pathway markers has been shown to be suitable for both logistic and NB classifiers. In addition, the proposed single pathway classification (SPC) ranker provided similar performance to the traditional SVM and Relief-F feature selection methods. The identification of PCOGs within each pathway, especially with the use of NCFS based on correlation with ideal markers (NCFS-i), helps to minimize the effect of potentially noisy experimental data, leading to accurate and robust classification results.
Keywords :
cancer; genetics; medical computing; pattern classification; NCFS-based pathway activity inference scheme; breast cancer; diagnostic marker; gene expression profile; microarray expression dataset; microarray-based disease classification; negatively correlated feature set; pathway marker; phenotype-correlated genes; ranker feature selection; single pathway classification; Accuracy; Breast cancer; Diseases; Gene expression; Logistics; Metastasis; Niobium;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), The 2011 International Joint Conference on
Conference_Location :
San Jose, CA
ISSN :
2161-4393
Print_ISBN :
978-1-4244-9635-8
Type :
conf
DOI :
10.1109/IJCNN.2011.6033658
Filename :
6033658
Link To Document :
بازگشت