DocumentCode :
3046765
Title :
Identification of mycobacterium species using curated custom databases
Author :
Kuyper, Dan ; Ali, Hesham H. ; Mohamed, Amr M. ; Hinrichs, Steven H.
Author_Institution :
Dept. of Comput. Sci., Nebraska Univ., Omaha, NE, USA
fYear :
2004
fDate :
26-30 April 2004
Firstpage :
191
Abstract :
Summary form only given. Advances in molecular biology have resulted in the development of diagnostic tests for infectious diseases based on genetic profiles. While probe based assays dominate the field today, sequence based assays hold great promise for the future. However, the variability in quality of sequence information currently present in public databases limits the potential growth and use of sequence based analysis. To address this problem a standardized method for DNA sequence validation and building of custom databases was developed using mycobacterium as a development model. With this model, a computational approach to identification of infectious diseases was developed and evaluated. The Web-based application, termed BioDatabase, accomplished genetic sequence identification via the creation of curated databases containing a relatively small set of genetic data specific to a species or group. The process for creation of the custom database included multiple steps beginning with identification of highly conserved start and end sequences and intervening sequence validation parameters. The process eliminated the need for multiple sequence alignment with GenBank sequences, whose information is valuable, yet difficult to properly utilize due to its size and quality. The custom database approach maximized application performance with minimal impact on analysis response time, allowing investigation of optimal sequences for identification of all mycobacterium to the species level. In comparison to the 16S and ITS genetic regions, a curated ITS based approach proved most effective for identification of mycobacterium isolates.
Keywords :
DNA; biology computing; database management systems; diseases; genetic algorithms; microorganisms; molecular biophysics; BioDatabase; DNA sequence validation; GenBank sequence; Web-based application; curated custom database; diagnostic test; genetic profile; genetic sequence identification; infectious diseases; molecular biology; mycobacterium species identification; probe based assay; public database; sequence based assay; sequence validation parameter; Computational modeling; DNA; Databases; Diseases; Genetics; Information analysis; Performance analysis; Probes; Sequences; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Processing Symposium, 2004. Proceedings. 18th International
Print_ISBN :
0-7695-2132-0
Type :
conf
DOI :
10.1109/IPDPS.2004.1303209
Filename :
1303209
Link To Document :
بازگشت