DocumentCode :
3145490
Title :
Anti-serendipity: finding useless documents and similar documents
Author :
Cooper, James W. ; Prager, John M.
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2000
fDate :
4-7 Jan. 2000
Abstract :
The problem of finding your way through a relatively unknown collection of digital documents can be daunting. Such collections sometimes have few categories and little hierarchy, or they have so much hierarchy that valuable relations between documents can easily become obscured. We describe here how our work in the area of term-recognition and sentence-based summarization can be used to filter the document lists that we return from searches. We can thus remove or downgrade the ranking of some documents that have limited utility even though they may match many of the search terms fairly accurately. We also describe how we can use this same system to find documents that are closely related to a document of interest, thus continuing our work to provide tools for query-free searching.
Keywords :
information retrieval; anti-serendipity; query-free searching; sentence-based summarization; similar documents; term-recognition; useless documents; Computer interfaces; Displays; Feedback; Filters; Performance analysis; Search engines; Statistics; Text analysis; Thesauri;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 2000. Proceedings of the 33rd Annual Hawaii International Conference on
Print_ISBN :
0-7695-0493-0
Type :
conf
DOI :
10.1109/HICSS.2000.926691
Filename :
926691
Link To Document :
بازگشت