Title :
Probabilistic principal surfaces for yeast gene microarray data mining
Author :
Staiano, Antonino ; De Vinco, Lara ; Ciaramella, Angelo ; Raiconi, Giancarlo ; Tagliaferri, Roberto ; Amato, Roberto ; Longo, Giuseppe ; Donalek, Ciro ; Miele, Gennaro ; Di Bernardo, Diego
Author_Institution :
Dipt. di Matematica ed Informatica, Univ. di Salerno, Italy
Abstract :
The recent technological advances are producing huge data sets in almost all fields of scientific research, from astronomy to genetics. Although each research field often requires ad-hoc, fine tuned, procedures to properly exploit all the available information inherently present in the data, there is an urgent need for a new generation of general computational theories and tools capable to boost most human activities of data analysis. Here, we propose probabilistic principal surfaces (PPS) as an effective high-D data visualization and clustering tool for data mining applications, emphasizing its flexibility and generality of use in data-rich field. In order to better illustrate the potentialities of the method, we also provide a real world case-study by discussing the use of PPS for the analysis of yeast gene expression levels from microarray chips.
Keywords :
biology computing; data analysis; data mining; data visualisation; genetics; pattern clustering; probability; astronomy; clustering tool; data analysis; data mining; data visualization; general computational theories; general computational tools; genetics; microarray chip; probabilistic principal surfaces; yeast gene microarray; Astronomy; Data analysis; Data mining; Data visualization; Fungi; Gene expression; Genetics; Humans; Space technology; Topology;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10088