• DocumentCode
    593917
  • Title

    A Hierarchical Approach for Clustering and Pattern Matching of Gene Expression Data

  • Author

    Hoque, Sanyat ; Istyaq, S. ; Riaz, M.M.

  • Author_Institution
    Electr. Eng. Sect., Aligarh Muslim Univ., Aligarh, India
  • fYear
    2012
  • fDate
    25-28 Aug. 2012
  • Firstpage
    413
  • Lastpage
    416
  • Abstract
    Clustering data is a well-known and challenging problem that has been widely studied in data base analysis. This paper shows how it made possible in genetic engineering to observe simultaneously the expression levels of huge genes during important genetic processes and identifies similar pattern genes and maximum matching regulation genes. This paper presents simple and static method to find regulation of gene from statistical gene expression data, comparison of regulation of genes and then form clusters with genes having similar regulation. Also extracts sub clusters from big cluster and perform hierarchical analysis. Finding coherent patterns or means from sub-clusters and by comparing sub cluster genes with coherent pattern it refines clusters and form fine clusters. This is a simple gradient base technique of finding regulation, hence it easily removes noisy genes. This technique relates bottom up approach. Also this process relates Density-Based clustering approach, hence can be done using density estimation. This scheme can extract clusters efficiently with reduced number of comparisons. Extracting the patterns from large genetic data enhances identification of similar characteristic genes and variety of hidden information about genes. Grouping similar data in large dimension spaces is to hidden pattern or meaningful subgroups have many applications in biological and other fields.
  • Keywords
    biology computing; genetic engineering; genetics; pattern matching; statistical analysis; clustering data; data base analysis; density based clustering; density estimation; genetic data; genetic engineering; genetic process; gradient base technique; hierarchical analysis; maximum matching regulation genes; pattern matching; statistical gene expression data; Clustering algorithms; DNA; Data mining; Educational institutions; Gene expression; Pattern matching; Clustering; density based; gene expression; gradient; hierarchical; microarray; regulation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Genetic and Evolutionary Computing (ICGEC), 2012 Sixth International Conference on
  • Conference_Location
    Kitakushu
  • Print_ISBN
    978-1-4673-2138-9
  • Type

    conf

  • DOI
    10.1109/ICGEC.2012.16
  • Filename
    6457131