DocumentCode
1694272
Title
A New Progressive Multiple Sequence Alignment Algorithm
Author
Hosni, Soumaya ; Mokaddem, Ahmed ; Elloumi, Mourad
Author_Institution
Lab. of Technol. of Inf. & Commun. & Electr. Eng. (LaTICE), Tunis Univ. of Tunis, Tunis, Tunisia
fYear
2012
Firstpage
195
Lastpage
198
Abstract
In this paper, we present a new progressive algorithm for Multiple Sequence Alignment (MSA). Our algorithm adopts a new distance, called anchor distance to compute the distance between two protein sequences, and a variant of the UPGMA method to construct a guide tree. Our algorithm implements also a refinement step. Our algorithm is of complexity O(N4+N*L2) in computing time, where N is the number of the sequences and L is the length of the longest sequence. We asses our algorithm on different protein benchmarks, e.g., BALIBASE, HOMSTRAD, OXBENCH, and we compared the obtained results to those obtained with other alignment algorithms, e.g., CLUSTALW, MUSCLE, MAFFT and PROBCONS, using the Column Score (CS) and the Sum of Pairs Score (SPS). We obtained good results.
Keywords
biology computing; computational complexity; proteins; BALIBASE; CLUSTALW; CS; HOMSTRAD; MAFFT; MSA; MUSCLE; O(N4+N*L2) complexity; OXBENCH; PROBCONS; SPS; UPGMA method; alignment algorithms; anchor distance; column score; guide tree; progressive multiple sequence alignment algorithm; protein benchmarks; protein sequences; sum of pairs score; Benchmark testing; Bioinformatics; Databases; Muscles; Proteins; Multiple sequence alignment; algorithms; distances; progressive alignment;
fLanguage
English
Publisher
ieee
Conference_Titel
Database and Expert Systems Applications (DEXA), 2012 23rd International Workshop on
Conference_Location
Vienna
ISSN
1529-4188
Print_ISBN
978-1-4673-2621-6
Type
conf
DOI
10.1109/DEXA.2012.8
Filename
6327425
Link To Document