• DocumentCode
    112473
  • Title

    Understanding User Intents in Online Health Forums

  • Author

    Zhang, Thomas ; Cho, Jason H. D. ; Chengxiang Zhai

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Urbana-Champaign, Urbana, IL, USA
  • Volume
    19
  • Issue
    4
  • fYear
    2015
  • fDate
    Jul-15
  • Firstpage
    1392
  • Lastpage
    1398
  • Abstract
    Online health forums provide a convenient way for patients to obtain medical information and connect with physicians and peers outside of clinical settings. However, large quantities of unstructured and diversified content generated on these forums make it difficult for users to digest and extract useful information. Understanding user intents would enable forums to find and recommend relevant information to users by filtering out threads that do not match particular intents. In this paper, we derive a taxonomy of intents to capture user information needs in online health forums and propose novel pattern-based features for use with a multiclass support vector machine (SVM) classifier to classify original thread posts according to their underlying intents. Since no dataset existed for this task, we employ three annotators to manually label a dataset of 1192 HealthBoards posts spanning four forum topics. Experimental results show that a SVM using pattern-based features is highly capable of identifying user intents in forum posts, reaching a maximum precision of 75%, and that a SVM-based hierarchical classifier using both pattern and word features outperforms its SVM counterpart that uses only word features. Furthermore, comparable classification performance can be achieved by training and testing on posts from different forum topics.
  • Keywords
    medical information systems; support vector machines; HealthBoards posts; SVM; hierarchical classifier; medical information; multiclass support vector machine classifier; online health forums; pattern-based features; user information needs; user intents; word features; Drugs; Message systems; Semantics; Support vector machines; Taxonomy; Training; Forum intents; online health forums; pattern based features; support vector machines; user intent classification;
  • fLanguage
    English
  • Journal_Title
    Biomedical and Health Informatics, IEEE Journal of
  • Publisher
    ieee
  • ISSN
    2168-2194
  • Type

    jour

  • DOI
    10.1109/JBHI.2015.2416252
  • Filename
    7066225