• DocumentCode
    131934
  • Title

    A hybrid approach to identifying sentiment polarity for new words

  • Author

    Yang Yang ; Ruifan Li ; Yanquan Zhou

  • Author_Institution
    Sch. of Comput. Sci., Beijing Univ. of Posts & Telecommun., Beijing, China
  • fYear
    2014
  • fDate
    11-14 May 2014
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Microblog is a typical form of heterogeneous information. For this information, identifying sentiment polarity of new words plays a fundamental role in sentiment analysis. In this paper, we proposed a hybrid approach using both statistic and syntax information to identifying the sentiment polarity of new words. We first filter the raw tweets out some noises and segment the clean data with POS tagging. Next, we collect new words by filtering rules. Then, we assign each new word with a polarity using both statistics and patterns information. We evaluate our approach on a real dataset from Sina Weibo, achieving a relatively high F-score of 0.241 compared with the baseline of 0.22.
  • Keywords
    Web sites; dictionaries; POS tagging; Sina Weibo; heterogeneous information; microblog; sentiment analysis; syntax information; Noise measurement; dictionaries; new words; patterns; sentiment polarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Wireless Communications, Vehicular Technology, Information Theory and Aerospace & Electronic Systems (VITAE), 2014 4th International Conference on
  • Conference_Location
    Aalborg
  • Print_ISBN
    978-1-4799-4626-6
  • Type

    conf

  • DOI
    10.1109/VITAE.2014.6934435
  • Filename
    6934435