• DocumentCode
    2454561
  • Title

    Automatic Detection of HIV Drug Resistance-Associated Mutations

  • Author

    Cheng, Betty Y. ; Carbonell, Jaime G.

  • Author_Institution
    Sch. of Comput. Sci., Carnegie Mellon Univ., Pittsburgh, PA, USA
  • fYear
    2010
  • fDate
    12-14 Dec. 2010
  • Firstpage
    528
  • Lastpage
    533
  • Abstract
    Each HIV-1 patient has a diverse population of virus strains in his/her body as the virus quickly replicates and mutates, requiring a combination drug therapy optimized to the patient´s unique viral population. Towards this goal, prediction systems have been developed to deduce the susceptibility of a given HIV genotype to a single drug. Many are rule-based systems or rely on hand-crafted features which are difficult to update for HIV strains and new drugs. We adapted the vector-of-n-grams approach from document classification and chi-square feature selection to automatically generate a feature set that yields comparable performance to the expert-selected and database-derived feature sets without requiring treatment history data. Our automatically-generated feature set also found all the expert-selected mutations and more demonstrating its potential for knowledge discovery. Compared to the previous state-of-the-art with ample expert knowledge, our best fully-automated prediction model for each drug yielded comparable performance at 82.9% classification accuracy and 0.819 coefficient of determination on average. Along with its lack of need for human expertise and potential for knowledge discovery, our automatic feature selection method is a good candidate for the more complex prediction task of combination drug therapy optimization.
  • Keywords
    data mining; drugs; medical computing; optimisation; patient diagnosis; HIV drug resistance-associated mutations; HIV genotype; automatic detection; drug therapy optimization; knowledge discovery; viral population; virus strains; Accuracy; Artificial neural networks; Classification algorithms; Classification tree analysis; Drugs; Human immunodeficiency virus; Immune system; Classification; Feature selection; Regression;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Applications (ICMLA), 2010 Ninth International Conference on
  • Conference_Location
    Washington, DC
  • Print_ISBN
    978-1-4244-9211-4
  • Type

    conf

  • DOI
    10.1109/ICMLA.2010.83
  • Filename
    5708881