DocumentCode :
3085789
Title :
Minimum number of genes for microarray feature selection
Author :
Baralis, Elena ; Bruno, Giulia ; Fiori, Alessandro
Author_Institution :
Politecnico di Torino, Italy
fYear :
2008
fDate :
20-25 Aug. 2008
Firstpage :
5692
Lastpage :
5695
Abstract :
A fundamental problem in microarray analysis is to identify relevant genes from large amounts of expression data. Feature selection aims at identifying a subset of features for building robust learning models. However, finding the optimal number of features is a challenging problem, as it is a trade off between information loss when pruning excessively and noise increase when pruning is too weak. This paper presents a novel representation of genes as strings of bits and a method which automatically selects the minimum number of genes to reach a good classification accuracy on the training set. Our method first eliminates redundant features, which do not add further information for classification, then it exploits a set covering algorithm. Preliminary experimental results on public datasets confirm the intuition of the proposed method leading to high classification accuracy.
Keywords :
Biology computing; Cancer; Computational efficiency; DNA; Diseases; Filters; Gene expression; Noise robustness; Testing; Time measurement; Algorithms; Artificial Intelligence; Diagnosis, Computer-Assisted; Gene Expression Profiling; Humans; Oligonucleotide Array Sequence Analysis; Pattern Recognition, Automated; Reproducibility of Results; Sample Size; Sensitivity and Specificity; Signal Processing, Computer-Assisted;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Engineering in Medicine and Biology Society, 2008. EMBS 2008. 30th Annual International Conference of the IEEE
Conference_Location :
Vancouver, BC
ISSN :
1557-170X
Print_ISBN :
978-1-4244-1814-5
Electronic_ISBN :
1557-170X
Type :
conf
DOI :
10.1109/IEMBS.2008.4650506
Filename :
4650506
Link To Document :
بازگشت