Title :
Sentence-Level Opinion-Topic Association for Opinion Detection in Blogs
Author :
Missen, Malik Muhammad Saad ; Boughanem, Mohand
Author_Institution :
IRIT, Univ. de Toulouse, Toulouse
Abstract :
The Opinion Detection from blogs has always been a challenge for researchers. One of the challenges faced is to find such documents that specifically contain opinion on users´ information need. This requires text processing on sentence level rather than on document level. In this paper, we have proposed an opinion detection approach. The proposed approach tries to tackle opinion detection problem by using some document level heuristics and processing documents on sentence level using different semantic similarity relations of WordNet between sentence words and list of weighted query terms expanded through encyclopedia Wikipedia. According to initial results, our approach performs well with MAP of 0.2177 with improvement of 28.89% over baseline results obtained through BM25 matching formula. TREC Blog 2006 data is used as test data collection.
Keywords :
Web sites; query processing; text analysis; blogs; document processing; encyclopedia Wikipedia; opinion detection approach; sentence-level opinion-topic association; text processing; weighted query term; Blogs; Dictionaries; Electronic mail; Encyclopedias; Face detection; Information retrieval; Machine learning; Testing; Text processing; Wikipedia; Information Retrieval; Opinion Detection; wordNet;
Conference_Titel :
Advanced Information Networking and Applications Workshops, 2009. WAINA '09. International Conference on
Conference_Location :
Bradford
Print_ISBN :
978-1-4244-3999-7
Electronic_ISBN :
978-0-7695-3639-2
DOI :
10.1109/WAINA.2009.157