DocumentCode
588706
Title
Why Web-Based Pseudo Relevance Feedback Systems Fail
Author
Jing Zhang ; Kok-leong Ong ; Lee, Victor C. S.
Author_Institution
Sch. of Inf. Technol., Deakin Univ., Burwood, VIC, Australia
fYear
2012
fDate
8-10 Nov. 2012
Firstpage
216
Lastpage
222
Abstract
We review pseudo-relevance feedback as a mechanism for expanding short texts. Where short texts exhibit evolving concepts, topics and other characteristics, Web-based feedback systems were touted as the most ideal way of enriching the feature space of short texts. However, we note from a recent implementation of a Web-based pseudo-relevance feedback that it would only perform well under clinical situations. Further improvements to address fundamental noise in Web documents did not show significant improvements leading us to conclude that relevance feedback using Web documents directly are unsuitable for real-world conditions. In this paper, we present Eddi, which is a recent system that provides an exemplar of a typical pseudo-relevance feedback system. We first show the conditions in which Eddi will work and then discuss the situations where it would fail. We then present the variations to Eddi from our attempt to improve the robustness of Eddi´s algorithm when dealing with complex Web documents. We then present the results from all variations to show the lack of robustness for pseudo-relevance feedback with Web documents.
Keywords
Web services; relevance feedback; text analysis; Eddi; Web based-pseudo relevance feedback; Web document; fundamental noise; text analysis; Blogs; Educational institutions; HTML; Internet; Noise; Noise measurement; Search engines; pseudo-relevance feedback; topic detection; twitter;
fLanguage
English
Publisher
ieee
Conference_Titel
Knowledge, Information and Creativity Support Systems (KICSS), 2012 Seventh International Conference on
Conference_Location
Melbourne, VIC
Print_ISBN
978-1-4673-4564-4
Type
conf
DOI
10.1109/KICSS.2012.40
Filename
6405532
Link To Document