DocumentCode
478695
Title
A novel gene-centric clustering algorithm for standardization of time series expression data
Author
Tsiporkova, Elena ; Boeva, Veselka
Author_Institution
House of Econ., Innovation Center - East Flanders, Ghent
Volume
2
fYear
2008
fDate
6-8 Sept. 2008
Firstpage
42712
Lastpage
42717
Abstract
This paper proposes a novel data transformation method aiming at multi-purpose data standardization and inspired by gene-centric clustering approaches. The idea is to perform data standardization via template matching of each expression profile with the rest of the expression profiles employing dynamic time warping (DTW) alignment algorithm to measure the similarity between the expression profiles. This algorithm facilitates the identification of a cluster of genes whose expression profiles are related, possibly with a nonlinear time shift, to the profile of the gene supplied as a template. Consequently, for each gene profile a varying number (based on the degree of similarity) of neighboring gene profiles is identified to be used in the subsequent standardization phase. The latter uses a recursive aggregation algorithm in order to reduce the set of neighboring expression profiles into a singe profile representing the standardized version of the profile in question. The proposed data transformation method is evaluated and demonstrated on gene expression time series data coming from a study examining the global cell-cycle control of gene expression in fission yeast Schizosaccharomyces pombe.
Keywords
genetics; medical computing; pattern clustering; pattern matching; time series; data standardization; data transformation method; dynamic time warping alignment algorithm; gene-centric clustering algorithm; recursive aggregation algorithm; template matching; time series expression data; yeast Schizosaccharomyces pombe; Bioinformatics; Clustering algorithms; Fungi; Gene expression; Genomics; Intelligent systems; Performance evaluation; Sampling methods; Standardization; Time measurement; DTW Distance; Data Standardization; RRN Algorithm; Time Series Data;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Systems, 2008. IS '08. 4th International IEEE Conference
Conference_Location
Varna
Print_ISBN
978-1-4244-1739-1
Electronic_ISBN
978-1-4244-1740-7
Type
conf
DOI
10.1109/IS.2008.4670512
Filename
4670512
Link To Document