Title :
Reduct Generation and Classification of Gene Expression Data
Author :
Momin, Bashirahamad F. ; Mitra, Sushmita ; Gupta, Rana Datta
Author_Institution :
Dept. of Comput. Sci. & Eng., Walchand Coll. of Eng., Vishrambag
Abstract :
Identification of gene subsets responsible for discerning between available samples of gene microarray data is an important task in bioinformatics. Due to the large number of genes in samples, there is an exponentially large search space of solutions. The main challenge is to reduce or remove the redundant genes, without affecting discernibility between objects. Reducts, from rough set theory, correspond to a minimal subset of essential genes. We present an algorithm for generating reducts from gene microarray data. It proceeds by preprocessing gene expression data, discretization of real value attributes into categorical followed by positive region based approach for reduct generation. For comparison, different approaches for reduct generation have also been discussed. Results on benchmark gene expression datasets demonstrate more than 90% reduction of redundant genes
Keywords :
biology computing; pattern classification; rough set theory; bioinformatics; gene expression data; gene microarray data; gene subsets; gene subsets identification; rough set theory; Bioinformatics; Computer science; Data engineering; Educational institutions; Gene expression; Information systems; Machine intelligence; Rough sets; Set theory; Uncertainty; Bioinformatics; Rough sets; classification.; microarray data; reduct; soft computing;
Conference_Titel :
Hybrid Information Technology, 2006. ICHIT '06. International Conference on
Conference_Location :
Cheju Island
Print_ISBN :
0-7695-2674-8
DOI :
10.1109/ICHIT.2006.253568