Title :
A Mathematical Model for the Validation of Gene Selection Methods
Author :
Muselli, Marco ; Bertoni, Alberto ; Frasca, Marco ; Beghini, Alessandro ; Ruffino, Francesca ; Valentini, Giorgio
Author_Institution :
Ist. di Elettron., di Ing. dell´´Informa- zione e delle Telecomun., Consiglio Naz. delle Ric., Geneva, Italy
Abstract :
Gene selection methods aim at determining biologically relevant subsets of genes in DNA microarray experiments. However, their assessment and validation represent a major difficulty since the subset of biologically relevant genes is usually unknown. To solve this problem a novel procedure for generating biologically plausible synthetic gene expression data is proposed. It is based on a proper mathematical model representing gene expression signatures and expression profiles through Boolean threshold functions. The results show that the proposed procedure can be successfully adopted to analyze the quality of statistical and machine learning-based gene selection algorithms.
Keywords :
Boolean functions; DNA; biology computing; genetics; learning (artificial intelligence); molecular biophysics; physiological models; statistical analysis; Boolean threshold functions; DNA microarray; biologically relevant subsets; gene expression profiles; gene expression signatures; gene selection methods; machine learning; mathematical model; statistical algorithms; Bioinformatics; Biological system modeling; Boolean functions; Data models; Gene expression; Mathematical model; Boolean functions.; Gene selection; feature selection; gene expression; mathematical models; Algorithms; Computational Biology; Computer Simulation; Databases, Factual; Gene Expression Profiling; Humans; Models, Genetic; Neoplasms; Oligonucleotide Array Sequence Analysis; Reproducibility of Results;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2010.83