• DocumentCode
    166604
  • Title

    Semantic Properties of Customer Sentiment in Tweets

  • Author

    Eun Hee Ko ; Klabjan, Diego

  • Author_Institution
    Program of Analytics, Northwestern Univ., Evanston, IL, USA
  • fYear
    2014
  • fDate
    13-16 May 2014
  • Firstpage
    657
  • Lastpage
    663
  • Abstract
    An increasing number of people are using online social networking services (SNSs), and a significant amount of information related to experiences in consumption is shared in this new media form. Text mining is an emerging technique for mining useful information from the web. We aim at discovering in particular tweets semantic patterns in consumers´ discussions on social media. Specifically, the purposes of this study are twofold: 1) finding similarity and dissimilarity between two sets of textual documents that include consumers´ sentiment polarities, two forms of positive vs. negative opinions and 2) driving actual content from the textual data that has a semantic trend. The considered tweets include consumers´ opinions on US retail companies (e.g., Amazon, Walmart). Cosine similarity and K-means clustering methods are used to achieve the former goal, and Latent Dirichlet Allocation (LDA), a popular topic modeling algorithm, is used for the latter purpose. This is the first study which discover semantic properties of textual data in consumption context beyond sentiment analysis. In addition to major findings, we apply LDA (Latent Dirichlet Allocations) to the same data and drew latent topics that represent consumers´ positive opinions and negative opinions on social media.
  • Keywords
    consumer behaviour; data mining; pattern clustering; retail data processing; social networking (online); text analysis; K-means clustering methods; Twitter; US retail companies; consumer opinions; consumer sentiment polarities; cosine similarity; customer sentiment semantic properties; latent Dirichlet allocation; online social networking services; sentiment analysis; text mining; textual data semantic properties; textual documents; topic modeling algorithm; tweet semantic patterns; Business; Correlation; Data mining; Media; Semantics; Tagging; Vectors; text analytics; tweet analysis; document similarity; clustering; topic modeling; part-of-speech tagging;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Information Networking and Applications Workshops (WAINA), 2014 28th International Conference on
  • Conference_Location
    Victoria, BC
  • Print_ISBN
    978-1-4799-2652-7
  • Type

    conf

  • DOI
    10.1109/WAINA.2014.151
  • Filename
    6844713