• DocumentCode
    2923360
  • Title

    Designing antimicrobial peptides with weighted finite-state transducers

  • Author

    Whelan, Christopher ; Roark, Brian ; Sönmez, Kemal

  • Author_Institution
    Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA
  • fYear
    2010
  • fDate
    Aug. 31 2010-Sept. 4 2010
  • Firstpage
    764
  • Lastpage
    767
  • Abstract
    The design of novel antimicrobial peptides (AMPs) is an important problem given the rise of drug-resistant bacteria. However, the large size of the sequence search space, combined with the time required to experimentally test or simulate AMPs at the molecular level makes computational approaches based on sequence analysis attractive. We propose a method for designing novel AMPs based on learning from n-gram counts of classes of amino acid residues, and then using weighted finite-state machines to produce sequences that incorporate those features that are strongly associated with AMP sequences. Finite-state machines are able to generate sequences that include desired n-gram features. We use this approach to generate candidate novel AMPs, which we test using third-party prediction servers. We demonstrate that our framework is capable of producing large numbers of novel peptide sequences that share features with known antimicrobial peptides.
  • Keywords
    antibacterial activity; biochemistry; bioinformatics; drugs; finite state machines; genetics; microorganisms; molecular biophysics; AMP sequences; amino acid residues; antimicrobial peptides; drug-resistant bacteria; n-gram counts; sequence analysis; sequence search space; third-party prediction servers; weighted finite-state machines; weighted finite-state transducers; Amino acids; Databases; Kernel; Peptides; Support vector machines; Training; Transducers; Algorithms; Amino Acid Sequence; Antimicrobial Cationic Peptides; Artificial Intelligence; Computer Simulation; Drug Design; Models, Chemical; Molecular Sequence Data; Pattern Recognition, Automated; Sequence Analysis, Protein; Transducers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Engineering in Medicine and Biology Society (EMBC), 2010 Annual International Conference of the IEEE
  • Conference_Location
    Buenos Aires
  • ISSN
    1557-170X
  • Print_ISBN
    978-1-4244-4123-5
  • Type

    conf

  • DOI
    10.1109/IEMBS.2010.5626357
  • Filename
    5626357