DocumentCode :
478695
Title :
A novel gene-centric clustering algorithm for standardization of time series expression data
Author :
Tsiporkova, Elena ; Boeva, Veselka
Author_Institution :
House of Econ., Innovation Center - East Flanders, Ghent
Volume :
2
fYear :
2008
fDate :
6-8 Sept. 2008
Firstpage :
42712
Lastpage :
42717
Abstract :
This paper proposes a novel data transformation method aiming at multi-purpose data standardization and inspired by gene-centric clustering approaches. The idea is to perform data standardization via template matching of each expression profile with the rest of the expression profiles employing dynamic time warping (DTW) alignment algorithm to measure the similarity between the expression profiles. This algorithm facilitates the identification of a cluster of genes whose expression profiles are related, possibly with a nonlinear time shift, to the profile of the gene supplied as a template. Consequently, for each gene profile a varying number (based on the degree of similarity) of neighboring gene profiles is identified to be used in the subsequent standardization phase. The latter uses a recursive aggregation algorithm in order to reduce the set of neighboring expression profiles into a singe profile representing the standardized version of the profile in question. The proposed data transformation method is evaluated and demonstrated on gene expression time series data coming from a study examining the global cell-cycle control of gene expression in fission yeast Schizosaccharomyces pombe.
Keywords :
genetics; medical computing; pattern clustering; pattern matching; time series; data standardization; data transformation method; dynamic time warping alignment algorithm; gene-centric clustering algorithm; recursive aggregation algorithm; template matching; time series expression data; yeast Schizosaccharomyces pombe; Bioinformatics; Clustering algorithms; Fungi; Gene expression; Genomics; Intelligent systems; Performance evaluation; Sampling methods; Standardization; Time measurement; DTW Distance; Data Standardization; RRN Algorithm; Time Series Data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems, 2008. IS '08. 4th International IEEE Conference
Conference_Location :
Varna
Print_ISBN :
978-1-4244-1739-1
Electronic_ISBN :
978-1-4244-1740-7
Type :
conf
DOI :
10.1109/IS.2008.4670512
Filename :
4670512
Link To Document :
بازگشت