• DocumentCode
    2412839
  • Title

    MetaPhyler: Taxonomic profiling for metagenomic sequences

  • Author

    Liu, Bo ; Gibbons, Theodore ; Ghodsi, Mohammad ; Pop, Mihai

  • Author_Institution
    Center for Bioinf. & Comput. Biol., Univ. of Maryland, College Park, MD, USA
  • fYear
    2010
  • fDate
    18-21 Dec. 2010
  • Firstpage
    95
  • Lastpage
    100
  • Abstract
    A major goal of metagenomics is to characterize the microbial diversity of an environment. The most popular approach relies on 16S rRNA sequencing, however this approach can generate biased estimates due to differences in the copy number of the 16S rRNA gene between even closely related organisms, and due to PCR artifacts. The taxonomic composition can also be determined from whole-metagenome sequencing data by matching individual sequences against a database of reference genes. One major limitation of prior methods used for this purpose is the use of a universal classification threshold for all genes at all taxonomic levels. We propose that better classification results can be obtained by tuning the taxonomic classifier to each matching length, reference gene, and taxonomic level. We present a novel taxonomic profiler MetaPhyler, which uses marker genes as a taxonomic reference. Results on simulated datasets demonstrate that MetaPhyler outperforms other tools commonly used in this context (CARMA, Megan and PhymmBL). We also present interesting results obtained by applying MetaPhyler to a real metagenomic dataset.
  • Keywords
    bioinformatics; genomics; molecular biophysics; molecular configurations; pattern classification; pattern matching; CARMA comparison; Megan comparison; MetaPhyler; PhymmBL comparison; marker genes; matching length; metagenomic sequences; microbial diversity; reference gene database; sequence matching; taxonomic classifier; taxonomic level; taxonomic profiling; whole metagenome sequencing data; Databases; Genomics; Linear regression; Microorganisms; Phylogeny; Sensitivity; metagenomics; phylogenetic classification; taxonomic profiling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedicine (BIBM), 2010 IEEE International Conference on
  • Conference_Location
    Hong Kong
  • Print_ISBN
    978-1-4244-8306-8
  • Electronic_ISBN
    978-1-4244-8307-5
  • Type

    conf

  • DOI
    10.1109/BIBM.2010.5706544
  • Filename
    5706544