Title :
Evaluation of criteria for information retrieval
Author :
ISHIOKA, Tsunenori
Author_Institution :
The Nat. Center for Univ. Entrance Examinations, Japan
Abstract :
We investigate van Rijsbergen´s F-measure, break-even point, and 11-point averaged precision, all of which can be translated into 1-dimensional scalar quantity from the precision and the recall. These investigations can be done by comparing to tetrachoric (four-fold) correlation coefficient and phi (four-fold point) coefficient, which are often used as the index of statistical association in a 2×2 contingency table. The results show that when a fallout rate is less than 0.1, (1) the F1 measure has similar properties of the phi coefficient, (2) the break-even point is almost equivalent to a phi coefficient, and (3) the 11-point averaged precision should be a measure, which is larger than a phi coefficient and has a value smaller than a tetrachoric correlation coefficient.
Keywords :
Internet; correlation methods; information retrieval system evaluation; relevance feedback; search engines; statistical analysis; Internet; break-even point; contingency table; fallout rate; information retrieval; phi coefficient; search engines; tetrachoric correlation coefficient; van Rijsbergen F-measure; Information retrieval; Internet; Machinery; Search engines;
Conference_Titel :
Web Intelligence, 2003. WI 2003. Proceedings. IEEE/WIC International Conference on
Print_ISBN :
0-7695-1932-6
DOI :
10.1109/WI.2003.1241232