DocumentCode
3022058
Title
ESTmapper: Efficiently Aligning DNA Sequences to Genomes
Author
Wu, Xue ; Lee, Woei-Jyh ; Tseng, Chau-Wen
Author_Institution
Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
fYear
2005
fDate
04-08 April 2005
Abstract
With improvements in technology, scientists are able to sequence the full DNA (genome) of an increasing number of organisms. One way biologists can take advantage of this genomic sequence data is to use it in conjunction with expressed sequence tag (EST) information to find genes and their splice sites. We describe how ESTmapper uses an eager write-only top-down (WOTD) suffix tree to efficiently align DNA sequences against genomes, and compare its precision and performance against popular techniques for DNA alignment (BLAT, sim4, Spidey, BLAST, megaBLAST) and EST clustering (TGICL and PaCE). Experimental results show that ESTmapper is 3 to 1000 times faster than current techniques for aligning and clustering DNA sequences, and produces alignments of comparable or better quality.
Keywords
DNA; biology computing; genetics; molecular biophysics; sequences; trees (mathematics); BLAST; BLAT; DNA sequence alignment; DNA sequence clustering; EST clustering; ESTmapper tool; PaCE; Spidey; TGICL; WOTD suffix tree; expressed sequence tag information; genomic sequence data; megaBLAST; sim4; write-only top-down suffix tree; Bioinformatics; Clustering algorithms; Computer science; DNA computing; Educational institutions; Genomics; Humans; Organisms; Sequences; Splicing;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN
0-7695-2312-9
Type
conf
DOI
10.1109/IPDPS.2005.204
Filename
1420088
Link To Document