• DocumentCode
    632883
  • Title

    Approaches to DNA de novo assembly

  • Author

    Sovic, Ivan ; Skala, Karolj ; Sikic, Mile

  • Author_Institution
    Centre for Inf. & Comput., Ruder Boskovic Inst., Zagreb, Croatia
  • fYear
    2013
  • fDate
    20-24 May 2013
  • Firstpage
    351
  • Lastpage
    359
  • Abstract
    DNA is the basic building block of all known life, accounting for all the diversities in nature. Determining the DNA of an individual organism is performed through a process called DNA sequencing. Although several different sequencing technologies do exist, they are limited and are able to acquire relatively short sequence reads. One of the approaches to sequencing involves randomly breaking a long DNA molecule into small fragments and sequencing only those fragments. Due to the random positioning of fragments on the source DNA, majority of them overlap, and provide the necessary information to combine them back together. The process of reconstructing the original DNA sequence from fragment reads is called DNA assembly. Assembly is a computationally very intensive process that may take days, or even weeks to produce the sequence of a more complex organism. Reconstructing a DNA sequence in the absence of a previously reconstructed reference sequence from a similar organism is called de novo assembly. De novo assembly methods currently provide the only means to discover new, previously unknown sequences, and are currently indispensable in biological research. In this paper, short descriptions of the sequencing process and the current sequencing platforms are given. DNA assembly process is thoroughly described, and the analysis of several de novo approaches used for assembly are presented. Overview and description of existing software tools are given, including some parallel implementations. As a conclusion, aspects of possible future development of DNA assembly are considered.
  • Keywords
    DNA; biology computing; DNA assembly process; DNA de novo assembly; DNA molecule; DNA sequencing; assembly; biological research; complex organism; individual organism; Assembly; Bioinformatics; DNA; Genomics; Layout; Sequential analysis; Solids; DNA sequencing; assembly; consensus; de Bruijn; layout; overlap; parallel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information & Communication Technology Electronics & Microelectronics (MIPRO), 2013 36th International Convention on
  • Conference_Location
    Opatija
  • Print_ISBN
    978-953-233-076-2
  • Type

    conf

  • Filename
    6596281