• DocumentCode
    2452575
  • Title

    Comparative Assessment of DNA Assemblers for Assembling Expressed Sequence Tags

  • Author

    Min, Xiangjia J. ; Butler, Gregory ; Storms, Reginald ; Tsang, Adrian

  • Author_Institution
    Dept. of Biol. Sci., Youngstown State Univ., Youngstown, OH, USA
  • fYear
    2009
  • fDate
    15-17 June 2009
  • Firstpage
    79
  • Lastpage
    82
  • Abstract
    Assembling expressed sequence tags (ESTs) is essential for removing redundancy and generating long virtual transcripts for EST annotation and gene finding. A number of assemblers are available, but there is a lack of detailed comparative assessment of the strength and weakness of these assemblers. We compared three assemblers including Phrap, CAP3 and TIGR Assembler (TA) using Aspergillus niger and Phanerochaete chrysosporium EST data. Phrap assembled more ESTs into contigs than TA and CAP3. Among the contigs and singletons generated by the three assemblers, 67 - 90% of them were identical. The number of contigs and singletons assembled by Phrap provides an estimate of the maximum number of unique genes represented in the dataset, while the numbers generated by TA and CAP3 provide an approximate estimate of unique transcripts since both TA and CAP are more discriminating to alternatively spliced transcripts. The error rate in contigs generated by Phrap was slightly higher than contigs generated by TA or CAP3. Phrap is thus recommended for EST assembling aiming at generating a set of unisequences with minimum redundancy for estimating the unigene number, and TA or CAP3 are used for assembling ESTs aiming at finding unique transcripts, i. e., for identification of alternative splicing.
  • Keywords
    DNA; bioinformatics; genetics; genomics; molecular biophysics; Aspergillus niger; CAP3; DNA assemblers; EST annotation; Phanerochaete chrysosporium; Phrap; TIGR Assembler; expressed sequence tags; gene finding; long virtual transcripts; redundancy; Assembly; Bioinformatics; Biology; Collaboration; DNA; Genomics; Organisms; Redundancy; Sequences; Splicing; EST; alternative splicing; assembler; transcript;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics, 2009. OCCBIO '09. Ohio Collaborative Conference on
  • Conference_Location
    Cleveland, OH
  • Print_ISBN
    978-0-7695-3685-9
  • Type

    conf

  • DOI
    10.1109/OCCBIO.2009.19
  • Filename
    5159166