DocumentCode
593917
Title
A Hierarchical Approach for Clustering and Pattern Matching of Gene Expression Data
Author
Hoque, Sanyat ; Istyaq, S. ; Riaz, M.M.
Author_Institution
Electr. Eng. Sect., Aligarh Muslim Univ., Aligarh, India
fYear
2012
fDate
25-28 Aug. 2012
Firstpage
413
Lastpage
416
Abstract
Clustering data is a well-known and challenging problem that has been widely studied in data base analysis. This paper shows how it made possible in genetic engineering to observe simultaneously the expression levels of huge genes during important genetic processes and identifies similar pattern genes and maximum matching regulation genes. This paper presents simple and static method to find regulation of gene from statistical gene expression data, comparison of regulation of genes and then form clusters with genes having similar regulation. Also extracts sub clusters from big cluster and perform hierarchical analysis. Finding coherent patterns or means from sub-clusters and by comparing sub cluster genes with coherent pattern it refines clusters and form fine clusters. This is a simple gradient base technique of finding regulation, hence it easily removes noisy genes. This technique relates bottom up approach. Also this process relates Density-Based clustering approach, hence can be done using density estimation. This scheme can extract clusters efficiently with reduced number of comparisons. Extracting the patterns from large genetic data enhances identification of similar characteristic genes and variety of hidden information about genes. Grouping similar data in large dimension spaces is to hidden pattern or meaningful subgroups have many applications in biological and other fields.
Keywords
biology computing; genetic engineering; genetics; pattern matching; statistical analysis; clustering data; data base analysis; density based clustering; density estimation; genetic data; genetic engineering; genetic process; gradient base technique; hierarchical analysis; maximum matching regulation genes; pattern matching; statistical gene expression data; Clustering algorithms; DNA; Data mining; Educational institutions; Gene expression; Pattern matching; Clustering; density based; gene expression; gradient; hierarchical; microarray; regulation;
fLanguage
English
Publisher
ieee
Conference_Titel
Genetic and Evolutionary Computing (ICGEC), 2012 Sixth International Conference on
Conference_Location
Kitakushu
Print_ISBN
978-1-4673-2138-9
Type
conf
DOI
10.1109/ICGEC.2012.16
Filename
6457131
Link To Document