• DocumentCode
    3022058
  • Title

    ESTmapper: Efficiently Aligning DNA Sequences to Genomes

  • Author

    Wu, Xue ; Lee, Woei-Jyh ; Tseng, Chau-Wen

  • Author_Institution
    Dept. of Comput. Sci., Maryland Univ., College Park, MD, USA
  • fYear
    2005
  • fDate
    04-08 April 2005
  • Abstract
    With improvements in technology, scientists are able to sequence the full DNA (genome) of an increasing number of organisms. One way biologists can take advantage of this genomic sequence data is to use it in conjunction with expressed sequence tag (EST) information to find genes and their splice sites. We describe how ESTmapper uses an eager write-only top-down (WOTD) suffix tree to efficiently align DNA sequences against genomes, and compare its precision and performance against popular techniques for DNA alignment (BLAT, sim4, Spidey, BLAST, megaBLAST) and EST clustering (TGICL and PaCE). Experimental results show that ESTmapper is 3 to 1000 times faster than current techniques for aligning and clustering DNA sequences, and produces alignments of comparable or better quality.
  • Keywords
    DNA; biology computing; genetics; molecular biophysics; sequences; trees (mathematics); BLAST; BLAT; DNA sequence alignment; DNA sequence clustering; EST clustering; ESTmapper tool; PaCE; Spidey; TGICL; WOTD suffix tree; expressed sequence tag information; genomic sequence data; megaBLAST; sim4; write-only top-down suffix tree; Bioinformatics; Clustering algorithms; Computer science; DNA computing; Educational institutions; Genomics; Humans; Organisms; Sequences; Splicing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel and Distributed Processing Symposium, 2005. Proceedings. 19th IEEE International
  • Print_ISBN
    0-7695-2312-9
  • Type

    conf

  • DOI
    10.1109/IPDPS.2005.204
  • Filename
    1420088