• DocumentCode
    86527
  • Title

    An Improved Approximation Algorithm for Scaffold Filling to Maximize the Common Adjacencies

  • Author

    Nan Liu ; Haitao Jiang ; Daming Zhu ; Binhai Zhu

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
  • Volume
    10
  • Issue
    4
  • fYear
    2013
  • fDate
    July-Aug. 2013
  • Firstpage
    905
  • Lastpage
    913
  • Abstract
    Scaffold filling is a new combinatorial optimization problem in genome sequencing. The one-sided scaffold filling problem can be described as given an incomplete genome I and a complete (reference) genome G, fill the missing genes into I such that the number of common (string) adjacencies between the resulting genome I´ and G is maximized. This problem is NP-complete for genome with duplicated genes and the best known approximation factor is 1.33, which uses a greedy strategy. In this paper, we prove a better lower bound of the optimal solution, and devise a new algorithm by exploiting the maximum matching method and a local improvement technique, which improves the approximation factor to 1.25. For genome with gene repetitions, this is the only known NP-complete problem which admits an approximation with a small constant factor (less than 1.5).
  • Keywords
    genetics; genomics; optimisation; NP-complete problem; common adjacencies; genome sequence; greedy strategy; maximum matching method; one-sided scaffold filling problem; optimization problem; Algorithm design and analysis; Approximation algorithms; Approximation methods; Bioinformatics; Educational institutions; Genomics; Sequential analysis; Comparative genomics; NP-completeness; adjacencies; algorithms; breakpoints; scaffold filling;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2013.100
  • Filename
    6582406