Title :
Parallel diffrential evolution clustering algorithm based on MapReduce
Author :
Daoudi, Meroua ; Hamena, Soumiya ; Benmounah, Zakaria ; Batouche, Mohamed
Author_Institution :
Dept. of Comput. Sci., Constantine 2 Univ., Constantine, Algeria
Abstract :
Cancer research is a challenging and competitive field. The study of gene expression data has enabled the discovery of unknown types of cancer using unsupervised learning. However, genomic sequence data are increasing in an exponential manner. Indeed, since 2011 the global annual sequencing capacity is estimated to be quadrillions of bases and counting. To cope with this issue, we propose, in this paper, the implementation of differential evolution clustering algorithm using MapReduce methodology in order to deal with big data. The proposed algorithm consists in three consecutive levels. Experiments were conducted on 18 real gene expression data sets. The obtained results have shown that our approach is effective and competes with existing algorithms.
Keywords :
Big Data; cancer; evolutionary computation; genomics; medical computing; parallel algorithms; pattern clustering; unsupervised learning; MapReduce methodology; big data; cancer research; differential evolution clustering algorithm; gene expression data; genomic sequence data; parallel differential evolution clustering algorithm; unsupervised learning; Algorithm design and analysis; Clustering algorithms; Evolution (biology); Gene expression; Partitioning algorithms; Vectors; Data Clustering; Diffrential Evolution; Gene Expression Analysis; MapReduce; Parallel Processing;
Conference_Titel :
Soft Computing and Pattern Recognition (SoCPaR), 2014 6th International Conference of
Conference_Location :
Tunis
DOI :
10.1109/SOCPAR.2014.7008029