DocumentCode
3626872
Title
A new evaluation measure for information retrieval systems
Author
Martin Mehlitz;Christian Bauckhage;Jerome Kunegis;Sahin Albayrak
Author_Institution
Technical University Berlin, DAI-Labor, 10587, Germany
fYear
2007
Firstpage
1200
Lastpage
1204
Abstract
Some of the established approaches to evaluating text clustering algorithms for information retrieval show theoretical flaws. In this paper, we analyze these flaws and introduce a new evaluation measure to overcome them. Based on a simple yet rigorous mathematical analysis of the effect of certain parameters in cluster based retrieval, we show that certain conclusions drawn in the recent literature must be taken with a grain of salt. Our new measure, in contrast, accounts for statistical biases that have to be expected according to our analysis. A series of experiments and a comparison with results reported recently underlines that this measure is a more suitable performance indicator that allows for more meaningful interpretations.
Keywords
"Information retrieval","Clustering algorithms","Mathematical analysis","Laboratories","Computer science","Storage automation","Mathematical model","Information analysis","Search engines"
Publisher
ieee
Conference_Titel
Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
ISSN
1062-922X
Print_ISBN
978-1-4244-0990-7
Type
conf
DOI
10.1109/ICSMC.2007.4413662
Filename
4413662
Link To Document