DocumentCode :
3299033
Title :
Rapid and robust ranking of text documents in a dynamically changing corpus
Author :
Park, Byung-Hoon ; Samatova, Nagiza F. ; Munavalli, Rajesh ; Krishnamurthy, Ramya ; Kettani, Houssain ; Geist, Al
Author_Institution :
Oak Ridge Nat. Lab., Oak Ridge
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
149
Lastpage :
155
Abstract :
Ranking documents in a selected corpus plays an important role in information retrieval systems. Despite notable advances in this direction, with continuously accumulating text documents, maintaining up-to-date ordering among documents in the domains of interest is a challenging task. Conventional approaches can produce an ordering that is only valid within a given corpus. Thus, with such approaches, ordering should be completely redone as documents are added to or deleted from the corpus. In this paper, we introduce a corpus- independent framework for rapid ordering of documents in a dynamically changing corpus. Like in many practical approaches, our framework suggests utilizing a similarity measure in some metric space indicating the degree of relevance of a document to the domain of interest. However, unlike in corpus- dependent approaches, the relevance score of a document remains valid with changes being introduced into the corpus (insertion of new documents, for example), thus allowing a rapid ordering within the corpus. This paper particularly details a statistical approach to compute such relevance scores.
Keywords :
information retrieval systems; corpus- independent framework; documents rapid ordering; dynamically changing corpus; information retrieval systems; text documents ranking; Computer science; Extraterrestrial measurements; Frequency; Information retrieval; Robustness; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications, 2008. AICCSA 2008. IEEE/ACS International Conference on
Conference_Location :
Doha
Print_ISBN :
978-1-4244-1967-8
Electronic_ISBN :
978-1-4244-1968-5
Type :
conf
DOI :
10.1109/AICCSA.2008.4493529
Filename :
4493529
Link To Document :
بازگشت