Title :
Gene selection using biological knowledge and fuzzy clustering
Author :
Ghosh, Sampreeti ; Mitra, Sushmita
Author_Institution :
Machine Intell. Unit, Indian Stat. Inst., India
Abstract :
Gene expression data being high-dimensional and redundant, dimensionality reduction is of prime concern. We employ the algorithm Fuzzy Clustering Large Applications based on RAN-domized Search (FCLARANS) for attribute clustering and dimensionality reduction based on the study of gene ontology and differential gene expressions. The use of domain knowledge helps in the automated selection of biologically meaningful partitions. The use of Gene Ontology (GO) study helps in detecting biologically enriched and statistically significant clusters. Fold-change is measured to select the differentially expressed genes as the representatives of these clusters. Tools like Eisen plot and cluster profiles of these clusters help establish their coherence. Important representative features (or genes) are extracted from each enriched gene partitions to form the reduced gene space. While the reduced gene set forms a biologically meaningful gene space it simultaneously leads to a decrease in computational burden. External validation of the reduced subspace, using various well-known classifiers, establishes the effectiveness of the proposed methodology.
Keywords :
bioinformatics; feature extraction; fuzzy set theory; ontologies (artificial intelligence); pattern classification; pattern clustering; Eisen plot; FCLARANS; GO; attribute clustering; biological knowledge; biologically meaningful partitions; computational burden; differential gene expressions; dimensionality reduction; domain knowledge; fuzzy clustering; fuzzy clustering large applications based on RAN-domized search; gene expression data; gene ontology; gene selection; gene space reduction; representative features extraction; statistically significant clusters; Cancer; Clustering algorithms; Colon; Gene expression; Ontologies; Uncertainty;
Conference_Titel :
Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1507-4
Electronic_ISBN :
1098-7584
DOI :
10.1109/FUZZ-IEEE.2012.6250797