• DocumentCode
    2887999
  • Title

    Learning Domain-Specific Polarity Lexicons

  • Author

    Demiroz, Gulsen ; Yanikoglu, Benin ; Tapucu, D. ; Saygin, Yucel

  • Author_Institution
    Fac. of Eng. & Natural Sci., Sabanci Univ., Istanbul, Turkey
  • fYear
    2012
  • fDate
    10-10 Dec. 2012
  • Firstpage
    674
  • Lastpage
    679
  • Abstract
    Sentiment analysis aims to automatically estimate the sentiment in a given text as positive or negative. Polarity lexicons, often used in sentiment analysis, indicate how positive or negative each term in the lexicon is. However, since creating domain-specific polarity lexicons is expensive and time consuming, researchers often use a general purpose or domain independent lexicon. In this work, we address the problem of adapting a general purpose polarity lexicon to a specific domain and propose a simple yet effective adaptation algorithm. We experimented with two sets of reviews from the hotel and movie domains and observed that while our adaptation techniques changed the polarity values for only a small set of words, the overall test accuracy increased significantly: 77% to 83% in the hotel dataset and 61% to 66% in the movie dataset.
  • Keywords
    data mining; learning (artificial intelligence); natural language processing; text analysis; adaptation algorithm; automatic sentiment estimation; domain-specific polarity lexicon learning; general purpose polarity lexicon; hotel dataset; hotel reviews; movie dataset; movie reviews; natural language processing; sentiment analysis; text analysis; Accuracy; Computational linguistics; Databases; Feature extraction; Motion pictures; Training; USA Councils; lexicon adaptation; machine learning; natural language processing; polarity detection; sentiment analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • Print_ISBN
    978-1-4673-5164-5
  • Type

    conf

  • DOI
    10.1109/ICDMW.2012.120
  • Filename
    6406504