DocumentCode :
3622095
Title :
Mining Plausible Patterns from Genomic Data
Author :
J. Klema;A. Soulet;B. Cremilleux;S. Blachon;O. Gandrillon
Author_Institution :
Université
fYear :
2006
fDate :
6/28/1905 12:00:00 AM
Firstpage :
183
Lastpage :
190
Abstract :
The discovery of biologically interpretable knowledge from gene expression data is one of the largest contemporary genomic challenges. As large volumes of expression data are being generated, there is a great need for automated tools that provide the means to analyze them. However, the same tools can provide an overwhelming number of candidate hypotheses which can hardly be manually exploited by an expert. An additional knowledge helping to focus automatically on the most plausible candidates only can up-value the experiment significantly. Background knowledge available in literature databases, biological ontologies and other sources can be used for this purpose. In this paper we propose and verify a methodology that enables to effectively mine and represent meaningful over-expression patterns. Each pattern represents a bi-set of a gene group over-expressed in a set of biological situations. The originality of the framework consists in its constraint-based nature and an effective cross-fertilization of constraints based on expression data and background knowledge. The result is a limited set of candidate patterns that are most likely interpretable by biologists. Supplemental automatic interpretations serve to ease this process. Various constraints can generate plausible pattern sets of different characteristics
Keywords :
"Genomics","Bioinformatics","Biological information theory","Gene expression","Frequency","Data mining","Databases","Ontologies","Character generation","Pattern analysis"
Publisher :
ieee
Conference_Titel :
Computer-Based Medical Systems, 2006. CBMS 2006. 19th IEEE International Symposium on
ISSN :
1063-7125
Print_ISBN :
0-7695-2517-1
Type :
conf
DOI :
10.1109/CBMS.2006.116
Filename :
1647566
Link To Document :
بازگشت