DocumentCode :
3022058
Title :
ESTmapper: Efficiently Aligning DNA Sequences to Genomes
Author :
Wu, Xue ; Lee, Woei-Jyh ; Tseng, Chau-Wen
Author_Institution :
Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
fYear :
2005
fDate :
04-08 April 2005
Abstract :
With improvements in technology, scientists are able to sequence the full DNA (genome) of an increasing number of organisms. One way biologists can take advantage of this genomic sequence data is to use it in conjunction with expressed sequence tag (EST) information to find genes and their splice sites. We describe how ESTmapper uses an eager write-only top-down (WOTD) suffix tree to efficiently align DNA sequences against genomes, and compare its precision and performance against popular techniques for DNA alignment (BLAT, sim4, Spidey, BLAST, megaBLAST) and EST clustering (TGICL and PaCE). Experimental results show that ESTmapper is 3 to 1000 times faster than current techniques for aligning and clustering DNA sequences, and produces alignments of comparable or better quality.
Keywords :
DNA; biology computing; genetics; molecular biophysics; sequences; trees (mathematics); BLAST; BLAT; DNA sequence alignment; DNA sequence clustering; EST clustering; ESTmapper tool; PaCE; Spidey; TGICL; WOTD suffix tree; expressed sequence tag information; genomic sequence data; megaBLAST; sim4; write-only top-down suffix tree; Bioinformatics; Clustering algorithms; Computer science; DNA computing; Educational institutions; Genomics; Humans; Organisms; Sequences; Splicing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
Print_ISBN :
0-7695-2312-9
Type :
conf
DOI :
10.1109/IPDPS.2005.204
Filename :
1420088
Link To Document :
بازگشت