DocumentCode
2452575
Title
Comparative Assessment of DNA Assemblers for Assembling Expressed Sequence Tags
Author
Min, Xiangjia J. ; Butler, Gregory ; Storms, Reginald ; Tsang, Adrian
Author_Institution
Dept. of Biol. Sci., Youngstown State Univ., Youngstown, OH, USA
fYear
2009
fDate
15-17 June 2009
Firstpage
79
Lastpage
82
Abstract
Assembling expressed sequence tags (ESTs) is essential for removing redundancy and generating long virtual transcripts for EST annotation and gene finding. A number of assemblers are available, but there is a lack of detailed comparative assessment of the strength and weakness of these assemblers. We compared three assemblers including Phrap, CAP3 and TIGR Assembler (TA) using Aspergillus niger and Phanerochaete chrysosporium EST data. Phrap assembled more ESTs into contigs than TA and CAP3. Among the contigs and singletons generated by the three assemblers, 67 - 90% of them were identical. The number of contigs and singletons assembled by Phrap provides an estimate of the maximum number of unique genes represented in the dataset, while the numbers generated by TA and CAP3 provide an approximate estimate of unique transcripts since both TA and CAP are more discriminating to alternatively spliced transcripts. The error rate in contigs generated by Phrap was slightly higher than contigs generated by TA or CAP3. Phrap is thus recommended for EST assembling aiming at generating a set of unisequences with minimum redundancy for estimating the unigene number, and TA or CAP3 are used for assembling ESTs aiming at finding unique transcripts, i. e., for identification of alternative splicing.
Keywords
DNA; bioinformatics; genetics; genomics; molecular biophysics; Aspergillus niger; CAP3; DNA assemblers; EST annotation; Phanerochaete chrysosporium; Phrap; TIGR Assembler; expressed sequence tags; gene finding; long virtual transcripts; redundancy; Assembly; Bioinformatics; Biology; Collaboration; DNA; Genomics; Organisms; Redundancy; Sequences; Splicing; EST; alternative splicing; assembler; transcript;
fLanguage
English
Publisher
ieee
Conference_Titel
Bioinformatics, 2009. OCCBIO '09. Ohio Collaborative Conference on
Conference_Location
Cleveland, OH
Print_ISBN
978-0-7695-3685-9
Type
conf
DOI
10.1109/OCCBIO.2009.19
Filename
5159166
Link To Document