Title :
Pattern Cores And Connectedness in Cancer Gene Expression
Author :
Yousri, Noha A. ; Kamel, Mohamed S. ; Ismail, Mohamed A.
Author_Institution :
Univ. of Waterloo, Waterloo
Abstract :
The huge number of gene expressions resulting from a single microarray experiment, together with the large number of tumor samples, needs efficient methods that can extract hidden information and structure in such data sets. Clustering is a common analysis tool used to find groups of gene expression patterns. However, analysis of large clusters can be an infeasible task in large sets. In this work, a method is proposed to capture the main structure of the data by identifying core gene expressions. This reduces the data to only a subset of representatives used to grasp the main behavior of gene expression. When integrated with clustering, it becomes feasible to analyze clusters of large sizes, and to identify main expression patterns and relations between them. The importance of using a connected-based clustering is emphasized in order to reveal the gradual change between core gene expressions, something which cannot be achieved using traditional clustering algorithms. Analysis is done on breast cancer data to illustrate the significance of the proposed methodology.
Keywords :
cancer; genetics; pattern clustering; tumours; breast cancer; connected-based clustering; gene expressions; pattern cores; tumor; Algorithm design and analysis; Cancer; Clustering algorithms; Data analysis; Data engineering; Data mining; Gene expression; Neoplasms; Pattern analysis; Shape; clustering; connected patterns; density-based cores; gene expression;
Conference_Titel :
Bioinformatics and Bioengineering, 2007. BIBE 2007. Proceedings of the 7th IEEE International Conference on
Conference_Location :
Boston, MA
Print_ISBN :
978-1-4244-1509-0
DOI :
10.1109/BIBE.2007.4375551