Title :
A comparative evaluation of term weighting methods for information filtering
Author :
Nanas, Nikolaos ; Uren, Victoria ; De Roeck, Anne
Author_Institution :
Knowledge Media Inst., Open Univ., Milton Keynes, UK
fDate :
30 Aug.-3 Sept. 2004
Abstract :
Users of information filtering systems cannot be expected to provide large amounts of information to initialize a profile. Therefore, term weighting methods for information filtering have somewhat different requirements to those for information retrieval and text categorization. We present a comparative evaluation of term weighting methods, including a new method, relative document frequency, designed specifically for information filtering. The best weighting methods appear to be those that favor information provided by the user, over information from a general collection.
Keywords :
information filtering; relevance feedback; statistical analysis; text analysis; information filtering systems; information retrieval; term weighting methods; text categorization; Data mining; Feedback; Frequency; Information filtering; Information filters; Information retrieval; Machine assisted indexing; Routing; Statistical distributions; Text categorization;
Conference_Titel :
Database and Expert Systems Applications, 2004. Proceedings. 15th International Workshop on
Print_ISBN :
0-7695-2195-9
DOI :
10.1109/DEXA.2004.1333442