• DocumentCode
    3626872
  • Title

    A new evaluation measure for information retrieval systems

  • Author

    Martin Mehlitz;Christian Bauckhage;Jerome Kunegis;Sahin Albayrak

  • Author_Institution
    Technical University Berlin, DAI-Labor, 10587, Germany
  • fYear
    2007
  • Firstpage
    1200
  • Lastpage
    1204
  • Abstract
    Some of the established approaches to evaluating text clustering algorithms for information retrieval show theoretical flaws. In this paper, we analyze these flaws and introduce a new evaluation measure to overcome them. Based on a simple yet rigorous mathematical analysis of the effect of certain parameters in cluster based retrieval, we show that certain conclusions drawn in the recent literature must be taken with a grain of salt. Our new measure, in contrast, accounts for statistical biases that have to be expected according to our analysis. A series of experiments and a comparison with results reported recently underlines that this measure is a more suitable performance indicator that allows for more meaningful interpretations.
  • Keywords
    "Information retrieval","Clustering algorithms","Mathematical analysis","Laboratories","Computer science","Storage automation","Mathematical model","Information analysis","Search engines"
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man and Cybernetics, 2007. ISIC. IEEE International Conference on
  • ISSN
    1062-922X
  • Print_ISBN
    978-1-4244-0990-7
  • Type

    conf

  • DOI
    10.1109/ICSMC.2007.4413662
  • Filename
    4413662