Title :
Exemplar Longest Common Subsequence
Author :
Bonizzoni, Paola ; Della Vedova, Gianluca ; Dondi, Riccardo ; Fertin, Guillaume ; Rizzi, Raffaella ; Vialette, Stéphane
Author_Institution :
Univ. degli Studi di Milano-Bicocca, Milan
Abstract :
In this paper, we investigate the computational and approximation complexity of the Exemplar Longest Common Subsequence (ELCS) of a set of sequences (ELCS problem), a generalization of the Longest Common Subsequence problem, where the input sequences are over the union of two disjoint sets of symbols, a set of mandatory symbols and a set of optional symbols. We show that different versions of the problem are APX-hard even for instances with two sequences. Moreover, we show that the related problem of determining the existence of a feasible solution of the ELCS of two sequences is NP-hard. On the positive side, we first present an efficient algorithm for the ELCS problem over instances of two sequences where each mandatory symbol can appear in total at most three times in the sequences. Furthermore, we present two fixed-parameter algorithms for the ELCS problem over instances of two sequences where the parameter is the number of mandatory symbols.
Keywords :
biology computing; computational complexity; genetics; greedy algorithms; sequences; string matching; APX-hard; NP-hard; approximation complexity; computational complexity; constrained string alignment problem; exemplar longest common subsequence; fixed-parameter algorithm; genomic string; greedy heuristic algorithm; Longest common subsequence; algorithm design and analysis; analysis of algorithms and problem complexity; combinatorial algorithms; comparative genomics; Algorithms; Computational Biology; Computers; Data Interpretation, Statistical; Models, Statistical; Models, Theoretical; Sequence Analysis, DNA; Software;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2007.1066