Title :
Microarray data analysis for cancer classification
Author :
Osareh, Alireza ; Shadgar, Bita
Author_Institution :
Comput. Eng. Dept., Islamic Azad Univ., Dezful, Iran
Abstract :
Cancer diagnosis is one of the most important emerging clinical applications of gene expression microarray data. In this work, we aim to develop an automated system for robust and reliable cancer diagnoses based on gene microarray data. Support vector machine classifiers outperform other popular classifiers, such as K nearest neighbours, naive Bayes, neural networks and decision tree, often to a remarkable degree. We choose a set of 9 publicly available benchmark microarray datasets that encompass both binary and multi-class cancer problems. Results of comparative studies are provided, demonstrating that effective feature selection is essential to the development of classifiers intended for use in gene-based cancer classification. In particular, amongst various systematic experiments carried out, best classification model is achieved using a subset of features chosen via information gain feature ranking for support vector machine classifier.
Keywords :
biological techniques; biomedical measurement; cancer; genetics; medical computing; molecular biophysics; patient diagnosis; pattern classification; support vector machines; automated system; benchmark microarray datasets; gene based cancer classification; gene expression microarray data; microarray data analysis; reliable cancer diagnoses; robust cancer diagnoses; support vector machine classifiers; Cancer; DNA; Data analysis; Data engineering; Data mining; Diseases; Filters; Gene expression; Support vector machine classification; Support vector machines; DN; Gene classification; Gene selection; K nearest neighbours; Support vector machines; component; formatting;
Conference_Titel :
Health Informatics and Bioinformatics (HIBIT), 2010 5th International Symposium on
Conference_Location :
Antalya
Print_ISBN :
978-1-4244-5968-1
DOI :
10.1109/HIBIT.2010.5478893