• DocumentCode
    478695
  • Title

    A novel gene-centric clustering algorithm for standardization of time series expression data

  • Author

    Tsiporkova, Elena ; Boeva, Veselka

  • Author_Institution
    House of Econ., Innovation Center - East Flanders, Ghent
  • Volume
    2
  • fYear
    2008
  • fDate
    6-8 Sept. 2008
  • Firstpage
    42712
  • Lastpage
    42717
  • Abstract
    This paper proposes a novel data transformation method aiming at multi-purpose data standardization and inspired by gene-centric clustering approaches. The idea is to perform data standardization via template matching of each expression profile with the rest of the expression profiles employing dynamic time warping (DTW) alignment algorithm to measure the similarity between the expression profiles. This algorithm facilitates the identification of a cluster of genes whose expression profiles are related, possibly with a nonlinear time shift, to the profile of the gene supplied as a template. Consequently, for each gene profile a varying number (based on the degree of similarity) of neighboring gene profiles is identified to be used in the subsequent standardization phase. The latter uses a recursive aggregation algorithm in order to reduce the set of neighboring expression profiles into a singe profile representing the standardized version of the profile in question. The proposed data transformation method is evaluated and demonstrated on gene expression time series data coming from a study examining the global cell-cycle control of gene expression in fission yeast Schizosaccharomyces pombe.
  • Keywords
    genetics; medical computing; pattern clustering; pattern matching; time series; data standardization; data transformation method; dynamic time warping alignment algorithm; gene-centric clustering algorithm; recursive aggregation algorithm; template matching; time series expression data; yeast Schizosaccharomyces pombe; Bioinformatics; Clustering algorithms; Fungi; Gene expression; Genomics; Intelligent systems; Performance evaluation; Sampling methods; Standardization; Time measurement; DTW Distance; Data Standardization; RRN Algorithm; Time Series Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Systems, 2008. IS '08. 4th International IEEE Conference
  • Conference_Location
    Varna
  • Print_ISBN
    978-1-4244-1739-1
  • Electronic_ISBN
    978-1-4244-1740-7
  • Type

    conf

  • DOI
    10.1109/IS.2008.4670512
  • Filename
    4670512