Title :
An Efficient Approach for Classification of Gene Expression Microarray Data
Author :
Sreepada, Rama Syamala ; Vipsita, Swati ; Mohapatra, Puspanjali
Author_Institution :
Dept. of Comput. Sc. Eng., IIIT Bhubaneswar, Bhubaneswar, India
Abstract :
Microarrays help in storing gene expression data from a cell. Each microarray describes features of each cell. The rows in microarray represent the samples and the columns represent the gene expression level of the cell. Microarray data is of high dimension due to which classification using conventional methods becomes tedious and inefficient. Therefore, reducing the dimension of long feature vector and extracting relevant features out of it becomes a very challenging task. This can be achieved using various techniques of feature extraction and/or feature selection. Design of an efficient classification model is another crucial task for any classification problem. In this paper, emphasis is given for significant feature extraction as well as efficient design of classifier. The task of microarray classification is done in two phases. In the first phase, a hybrid approach of Genetic Algorithm (GA) and Principal Component Analysis (PCA) is used for extracting relevant features. In the second phase, Probabilistic Neural Network (PNN) is used as the classifier and GA is implemented to optimize the topology of the PNN. The datasets used in the experiment are Colon Tumor, Diffuse Large B-Cell Lymphoma (DLBCL) and Leukaemia (ALL and AML). The proposed technique gave efficient results for the datasets used.
Keywords :
cancer; feature extraction; feature selection; genetic algorithms; genetics; lab-on-a-chip; medical computing; neural nets; pattern classification; principal component analysis; probability; topology; tumours; ALL; AML; DLBCL; GA; PCA; PNN topology; colon tumor; diffuse large B-cell lymphoma; feature extraction; feature selection; feature vector; gene expression data storing; gene expression microarray data classification approach; genetic algorithm; hybrid approach; leukemia; principal component analysis; probabilistic neural network; Accuracy; Information technology; Feature extraction; Genetic Algorithms; Microarray; Probabilistic Neural Network; feature selection;
Conference_Titel :
Emerging Applications of Information Technology (EAIT), 2014 Fourth International Conference of
DOI :
10.1109/EAIT.2014.46