• DocumentCode
    2414694
  • Title

    FluRF, an automated flu virus reassortment finder based on phylogenetic trees

  • Author

    Yurovsky, Alisa ; Moret, Bernard M E

  • Author_Institution
    Lab. for Comput. Biol. & Bioinf., EPFL (Swiss Fed. Inst. of Technol.), Lausanne, Switzerland
  • fYear
    2010
  • fDate
    18-21 Dec. 2010
  • Firstpage
    579
  • Lastpage
    584
  • Abstract
    Reassortments are events in the evolution of the genome of influenza (flu). As reassortments have been implicated in major human pandemics of the last century, their identification has become a health priority. While such identification can be done on a small dataset by a biologist (using phylogenetic trees), databases of flu sequences are growing exponentially, so that it is imperative to develop automated identification methods. However, current methods are limited to pairwise segment comparisons. We present FluRF, a fully automated flu virus reassortment finder. FluRF is inspired by the visual approach to reassortment identification and thus uses the reconstructed phylogenetic trees of the individual segments and the full genome. We also present a simple flu evolution simulator, used to produce synthetic datasets to tune the FluRF parameters. On synthetic datasets produced by our flu evolution simulator, FluRF, tuned for a 0% false positive rate, yielded false negative rates of 10% or less. FluRF corroborated two new reassortments identified by visual analysis of 75 Human H3N2 New York flu strains from 2005-2008 and gave partial verification of reassortments found using another bioinformatics method. FluRF finds reassortments by a bottom-up search of the full-genome and segment-based phylogenetic trees for candidate clades-groups of one or more sampled viruses that are separated from the other variants from the same season. Candidate clades in each tree are tested to guarantee confidence values, using the lengths of key edges as well as other tree parameters; clades with reassortments must have validated incongruencies among segment trees. FluRF demonstrates robustness of prediction for geographically and temporally expanded datasets, and is not limited to finding reassortments with previously collected sequences. The complete source code is available from lcbb.epfl.ch/software.html.
  • Keywords
    bioinformatics; cellular biophysics; data analysis; genomics; microorganisms; molecular biophysics; H3N2 New York flu strains; automated flu virus reassortment finder; bioinformatics; dataset; flu sequence; genome; influenza; phylogenetic trees; Bioinformatics; Genomics; Humans; Inspection; Phylogeny; Strain; Visualization; algorithms; biology; microorganisms; modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-8306-8
  • Electronic_ISBN
    978-1-4244-8307-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2010.5706632
  • Filename
    5706632