• DocumentCode
    538045
  • Title

    “Beautiful picture of an ugly place”. Exploring photo collections using opinion and sentiment analysis of user comments

  • Author

    Kisilevich, Slava ; Rohrdant, Christian ; Keim, Daniel

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Univ. of Konstanz, Konstanz, Germany
  • fYear
    2010
  • fDate
    18-20 Oct. 2010
  • Firstpage
    419
  • Lastpage
    428
  • Abstract
    User generated content in the form of customer reviews, feedbacks and comments plays an important role in all types of Internet services and activities like news, shopping, forums and blogs. Therefore, the analysis of user opinions is potentially beneficial for the understanding of user attitudes or the improvement of various Internet services. In this paper, we propose a practical unsupervised approach to improve user experience when exploring photo collections by using opinions and sentiments expressed in user comments on the uploaded photos. While most existing techniques concentrate on binary (negative or positive) opinion orientation, we use a real-valued scale for modeling opinion and sentiment strengths. We extract two types of sentiments: opinions that relate to the photo quality and general sentiments targeted towards objects depicted on the photo. Our approach combines linguistic features for part of speech tagging, traditional statistical methods for modeling word importance in the photo comment corpora (in a real-valued scale), and a predefined sentiment lexicon for detecting negative and positive opinion orientation. In addition, a semi-automatic photo feature detection method is applied and a set of syntactic patterns is introduced to resolve opinion references. We implemented a prototype system that incorporates the proposed approach and evaluates it on several regions in the World using real data extracted from Flickr.
  • Keywords
    Internet; behavioural sciences computing; feature extraction; information retrieval; software prototyping; user modelling; Flickr; Internet services; binary opinion orientation; data extraction; linguistic features; photo collections; photo comment corpora; photo quality; practical unsupervised approach; predefined sentiment lexicon; prototype system; real-valued scale; semiautomatic photo feature detection method; speech tagging; syntactic patterns; traditional statistical methods; user attitudes; user experience; user opinions; word importance modeling; Blogs; Dictionaries; Feature extraction; Frequency measurement; Internet; Motion pictures; Syntactics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
  • Conference_Location
    Wisla
  • ISSN
    2157-5525
  • Print_ISBN
    978-1-4244-6432-6
  • Type

    conf

  • DOI
    10.1109/IMCSIT.2010.5679726
  • Filename
    5679726