Title :
Applying neural networks to classify influenza virus antigenic types and hosts
Author :
Attaluri, Pavan K. ; Chen, Zhengxin ; Lu, Guoqing
Author_Institution :
Comput. Sci. Dept., Univ. of Nebraska at Omaha, Omaha, NE, USA
Abstract :
Influenza viruses continue to evolve rapidly and are responsible for seasonal epidemics and occasional, but catastrophic, pandemics. We recently demonstrated the use of decision tree and support vector machine methods in classifying pandemic swine flu viral strains with high accuracy. Here, we applied the technique of artificial neural networks for the prediction of important influenza virus antigenic types (H1, H3, and H5) and hosts (Human, Avian, and Swine), which fulfills a critical need for a computational system for influenza surveillance. A comprehensive experiment on different k-mers and different binary encoding types showed classification based upon frequencies of k-mer nucleotide strings performed better than transformed binary data of nucleotides. It has been found for the first time that the accuracy of virus classification varies from host to host and from gene segment to gene segment. In particular, compared to avian and swine viruses, human influenza viruses can be classified with high accuracy, which indicates influenza virus strains might have become well adapted to their human host and hence less variation occurs in human viruses. In addition, the accuracy of host classification varies from genome segment to segment, achieving the highest values when using the HA and NA segments for human host classification. This research, along with our previous studies, shows machine learning techniques play an indispensable role in virus classification.
Keywords :
biology computing; decision trees; microorganisms; neural nets; pattern classification; support vector machines; antigenic types; artificial neural networks; binary encoding types; decision tree; influenza virus antigenic types; influenza virus classification; k-mer nucleotide strings; pandemic swine flu viral strains classification; support vector machine method; Artificial neural networks; Capacitive sensors; Classification tree analysis; Decision trees; Humans; Influenza; Neural networks; Support vector machine classification; Support vector machines; Viruses (medical);
Conference_Titel :
Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2010 IEEE Symposium on
Conference_Location :
Montreal, QC
Print_ISBN :
978-1-4244-6766-2
DOI :
10.1109/CIBCB.2010.5510726