DocumentCode
2889955
Title
Approximate Common Intervals in Multiple Genome Comparison
Author
Chateau, Annie ; Riou, Pierre ; Rivals, Eric
Author_Institution
LIRMM, Univ. Montpellier 2, Montpellier, France
fYear
2011
fDate
12-15 Nov. 2011
Firstpage
131
Lastpage
134
Abstract
We consider the problem of inferring approximate common intervals of multiple genomes. Genomes are modelled as sequences of homologous genes families identifiers, and approximate common intervals represent conserved regions possibly showing rearrangements, as well as repetitions, or insertions/deletions. This problem is already known, but existing approaches are not incremental and somehow limited to special cases. We adopt a simple, classical graph-based approach, where the vertices of the graph represent the exact common intervals of the sequences (i.e., regions containing the same gene set), and where edges link vertices that differ by less than δ elements (with δ being parameter). With this model, approximate gene clusters are maximal cliques of the graph: computing them can exploit known and well designed algorithms. For a proof of concept, we applied the method to several datasets of bacterial genomes and compared the two maximal cliques algorithms, a static and a dynamic one. While being quite flexible, this approach opens the way to a combinatorial characterization of genomic rearrangements in terms of .
Keywords
biology computing; data visualisation; genomics; graph theory; molecular biophysics; approximate gene cluster; bacterial genome dataset; dynamic maximal clique algorithm; gene deletion; gene insertion; gene rearrangement; gene repetition; genes family identifier sequence; graph substructure; graph vertex; multiple genome interval; static maximal clique algorithm; Approximation algorithms; Bioinformatics; Clustering algorithms; Genomics; Heuristic algorithms; Microorganisms;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics and Biomedicine (BIBM), 2011 IEEE International Conference on
Conference_Location
Atlanta, GA
Print_ISBN
978-1-4577-1799-4
Type
conf
DOI
10.1109/BIBM.2011.96
Filename
6120422
Link To Document