DocumentCode
538045
Title
“Beautiful picture of an ugly place”. Exploring photo collections using opinion and sentiment analysis of user comments
Author
Kisilevich, Slava ; Rohrdant, Christian ; Keim, Daniel
Author_Institution
Dept. of Comput. & Inf. Sci., Univ. of Konstanz, Konstanz, Germany
fYear
2010
fDate
18-20 Oct. 2010
Firstpage
419
Lastpage
428
Abstract
User generated content in the form of customer reviews, feedbacks and comments plays an important role in all types of Internet services and activities like news, shopping, forums and blogs. Therefore, the analysis of user opinions is potentially beneficial for the understanding of user attitudes or the improvement of various Internet services. In this paper, we propose a practical unsupervised approach to improve user experience when exploring photo collections by using opinions and sentiments expressed in user comments on the uploaded photos. While most existing techniques concentrate on binary (negative or positive) opinion orientation, we use a real-valued scale for modeling opinion and sentiment strengths. We extract two types of sentiments: opinions that relate to the photo quality and general sentiments targeted towards objects depicted on the photo. Our approach combines linguistic features for part of speech tagging, traditional statistical methods for modeling word importance in the photo comment corpora (in a real-valued scale), and a predefined sentiment lexicon for detecting negative and positive opinion orientation. In addition, a semi-automatic photo feature detection method is applied and a set of syntactic patterns is introduced to resolve opinion references. We implemented a prototype system that incorporates the proposed approach and evaluates it on several regions in the World using real data extracted from Flickr.
Keywords
Internet; behavioural sciences computing; feature extraction; information retrieval; software prototyping; user modelling; Flickr; Internet services; binary opinion orientation; data extraction; linguistic features; photo collections; photo comment corpora; photo quality; practical unsupervised approach; predefined sentiment lexicon; prototype system; real-valued scale; semiautomatic photo feature detection method; speech tagging; syntactic patterns; traditional statistical methods; user attitudes; user experience; user opinions; word importance modeling; Blogs; Dictionaries; Feature extraction; Frequency measurement; Internet; Motion pictures; Syntactics;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
Conference_Location
Wisla
ISSN
2157-5525
Print_ISBN
978-1-4244-6432-6
Type
conf
DOI
10.1109/IMCSIT.2010.5679726
Filename
5679726
Link To Document