Title :
Typing Staphylococcus aureus Using the spa Gene and Novel Distance Measures
Author :
Agius, Phaedra ; Kreiswirth, Barry N. ; Naidich, Steve ; Bennett, Kristin P.
Author_Institution :
Bristol Univ., Bristol
Abstract :
We developed an approach for identifying groups or families of Staphylococcus aureus bacteria based on genotype data. With the emergence of drug-resistant strains, S. aureus represents a significant human health threat. Identifying the family types efficiently and quickly is crucial in community settings. Here, we develop a hybrid sequence algorithm approach to type this bacterium using only its spa gene. Two of the sequence algorithms we used are well established, whereas the third, the best common gap-weighted sequence (BCGS), is novel. We combined the sequence algorithms with a weighted match/mismatch algorithm for the spa sequence ends. Normalized similarity scores and distances between the sequences were derived and used within unsupervised clustering methods. The resulting spa groupings correlated strongly with the groups defined by the well-established multilocus sequence typing (MLST) method, spa typing is preferable to MLST typing, which types seven genes instead of just one. Furthermore, our spa clustering methods can be fine-tuned to be more discriminating than MLST, identifying new strains that the MLST method may not. Finally, we performed a multidimensional scaling of our distance matrices to visualize the relationship between isolates. The proposed methodology provides a promising new approach to molecular epidemiology.
Keywords :
DNA; biology computing; diseases; genetics; microorganisms; molecular biophysics; pattern clustering; DNA sequences; Staphylococcus aureus bacteria; best common gap-weighted sequence; drug-resistant strains; genotype data; human health threat; hybrid sequence algorithm approach; molecular epidemiology; multidimensional scaling; multilocus sequence typing method; spa gene; unsupervised clustering methods; clustering; genotyping; moleuclar epidemiology; sequence algorithms; staphylococcus aureus; Algorithms; Bacterial Typing Techniques; Cluster Analysis; Computational Biology; Epidemiology, Molecular; Gene Expression Profiling; Gene Expression Regulation, Bacterial; Genotype; Models, Genetic; Models, Statistical; Models, Theoretical; Oligonucleotide Array Sequence Analysis; Protein Interaction Mapping; Reproducibility of Results; Software; Staphylococcus aureus;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/tcbb.2007.1053