Title :
gWord: A Tool for Genome-Wide Word Search and Count
Author :
Xie Jianming ; Sun Xiao ; Lu Zhiyuan ; Xue Weiyan ; Dong Xianjun ; Lu Zuhong
Author_Institution :
State Key Lab. of Bioelectronics, Southeast Univ., Nanjing
Abstract :
Word search and count in whole genome DNA sequence is very important for re-sequencing an organism´s genome DNA using the microarray based technology and studying the functional element´s function based on genome- wide approach. A stand-alone program named gWord (abbr. of genome Word), which applies a fast algorithm to rapidly map the n-mer word into an index in memory, is developed for the task that can fulfill two main functions: counting all possible n-mer short DNA sequences in genome and acquiring the locations of one motif or those words presented only once in genome DNA. In addition, the search hits of any word will be annotated with gene information. Two examples on human genome are given to demonstrate the application of gWord in genomics research.
Keywords :
DNA; arrays; biology computing; cellular biophysics; genetics; molecular biophysics; gWord; genome Word; genome-wide word search; human genome; microarray; whole genome DNA sequence; Bioinformatics; Books; DNA; Genetics; Genomics; Humans; Laboratories; Organisms; Sequences; Sun;
Conference_Titel :
Bioinformatics and Biomedical Engineering, 2007. ICBBE 2007. The 1st International Conference on
Conference_Location :
Wuhan
Print_ISBN :
1-4244-1120-3
DOI :
10.1109/ICBBE.2007.84