Title :
Multicategory cancer classification from gene expression data by multiclass NPPC ensemble
Author :
Ghorai, Santanu ; Mukherjee, Anirban ; Sengupta, Sanghamitra ; Dutta, Pranab K.
Author_Institution :
Dept. of ECE, MCKV Inst. of Eng., Howrah, India
Abstract :
The discovery of DNA microarray technologies have given immense opportunity to make gene expression profiles for different cancer types. Besides binary classification such as normal versus tumor samples the discrimination of multiple tumor types is also important. In this work, we have first extended the recently developed binary nonparallel plane proximal classifier (NPPC) to multiclass NPPC by decomposition techniques. The multiclass NPPC is then used in a computer aided diagnosis framework to classify multicategory cancer from gene expression data by selecting very few genes by using mutual information criterion. The idea of binary NPPC ensemble is extended to form multiclass NPPC ensemble. Besides usual majority voting method, we have introduced minimum average proximity based decision combiner for multiclass NPPC ensemble. The effectiveness of the proposed method are demonstrated on four benchmark microarray data sets and compared with support vector machine (SVM) classifier in a similar framework.
Keywords :
bioinformatics; cancer; genetics; lab-on-a-chip; support vector machines; tumours; DNA microarray; computer aided diagnosis; decomposition technique; gene expression; majority voting method; minimum average proximity; multicategory cancer classification; multiclass NPPC ensemble; nonparallel plane proximal classifier; support vector machine; tumor; Computers; Extraterrestrial measurements; Kernel; Support vector machine classification; Vectors; cancer classification; classifier ensemble; microarray data analysis; proximal classifier;
Conference_Titel :
Systems in Medicine and Biology (ICSMB), 2010 International Conference on
Conference_Location :
Kharagpur
Print_ISBN :
978-1-61284-039-0
DOI :
10.1109/ICSMB.2010.5735343