Title :
Progressive alignment using Shortest Common Supersequence
Author :
Garg, Adesh ; Garg, Deepak
Author_Institution :
Dept. of Comput. Sci. & Eng., Thapar Univ., Patiala, India
Abstract :
Multiple Sequence Alignment is an NP-hard problem. The complexity of finding the optimal alignment is O(LN) where L is the length of the longest sequence and N is the number of sequences. Hence the optimal solution is nearly impossible for most of the datasets. Progressive alignment solves MSA in very economic complexity but does not provide accurate solutions because there is a tradeoff between accuracy and complexity. Guide tree that guides the alignment of sequences is generated by alignment score in progressive alignment. In this paper, Shortest Common Supersequence (SCS) is utilized to generate the guide tree for progressive alignment and the output alignment results are checked by BAliBASE benchmarks for accuracy. According to SP and TC scores, progressive alignment using the guide tree generated by SCS is better than the guide tree generated by alignment score. Original ClustalW2.1 is modified by SCS, and modified ClustalW2.1 gives better results than the original tool.
Keywords :
bioinformatics; computational complexity; genetics; optimisation; trees (mathematics); DNA sequences; NP-hard problem; SCS; guide tree; multiple sequence alignment; progressive alignment; shortest common supersequence; Accuracy; Bioinformatics; Complexity theory; DNA; Hidden Markov models; Proteins; Vegetation; BAliBASE; ClustalW2; Multiple sequence alignment; Progressive alignment; SCS;
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-3078-4
DOI :
10.1109/ICACCI.2014.6968310