DocumentCode :
393970
Title :
A clustering algorithm for gene expression data using wavelet packet decomposition
Author :
Rao, Arvind
Author_Institution :
Dept. of Electron. & Comput. Eng., Texas Univ., Austin, TX, USA
Volume :
1
fYear :
2002
fDate :
3-6 Nov. 2002
Firstpage :
316
Abstract :
Mining large amounts of data and deriving meaning from the mined data in bioinformatics is a computationally intensive and relevant issue. In this paper, an efficient algorithm to cluster genes into similar ´functional´ groups is presented. This is a technique for extracting and characterizing rhythmic expression profiles from genome-wide DNA micro-array hybridization data. These patterns are clues to discovering rhythmic genes implicated in cell-cycle, circadian, or other biological processes. These functionalities are discussed. A signal-processing approach to this problem is presented. The information theoretic criterion for identifying those genes exhibiting maximum variation in behavior is explored. The genes are clustered and then relationships are derived for the proposition of a temporal cell-cycle model governing regulatory behavior. The human fibroblast and yeast data set are presently considered for analysis.
Keywords :
DNA; biology computing; data mining; genetics; pattern clustering; signal processing; wavelet transforms; bioinformatics; cell-cycle; circadian process; clustering algorithm; entropy; gene expression data; genome-wide DNA micro-array hybridization data; human fibroblast; information theoretic criterion; rhythmic expression profiles; signal processing approach; temporal cell-cycle model; wavelet packet decomposition; yeast data set; Bioinformatics; Biological processes; Biological system modeling; Clustering algorithms; DNA; Data mining; Gene expression; Genomics; Humans; Wavelet packets;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signals, Systems and Computers, 2002. Conference Record of the Thirty-Sixth Asilomar Conference on
Conference_Location :
Pacific Grove, CA, USA
ISSN :
1058-6393
Print_ISBN :
0-7803-7576-9
Type :
conf
DOI :
10.1109/ACSSC.2002.1197198
Filename :
1197198
Link To Document :
بازگشت