Title of article :
An Entropy-Based Interpretation of Retrieval Status
Value-Based Retrieval, and Its Application to the
Computation of Term and Query Discrimination Value
Author/Authors :
Sa´ ndor Dominich، نويسنده , , Ju´ lia Go´ th، نويسنده , , Tama´ s Kiezer، نويسنده , , and Zolta´ n Szla´ vik، نويسنده ,
Issue Information :
ماهنامه با شماره پیاپی سال 2004
Abstract :
been applied to a number of information retrieval tasks
such as to formalize the probabilistic model, to design
practical retrieval systems, to cluster documents, and to
model texture in image retrieval. In this report, the concept
of entropy is used for a different purpose. It is
shown that any positive Retrieval Status Value (RSV)-
based retrieval system may be conceived as a special
probability space in which the amount of the associated
Shannon information is being reduced; in this view, the
retrieval system is referred to as Uncertainty Decreasing
Operation (UDO). The concept of UDO is then proposed
as a theoretical background for term and query discrimination
power, and it is applied to the computation of
term and query discrimination values in the vector space
retrieval model. Experimental evidence is given as regards
such computation; the results obtained compare
well to those obtained using vector-based calculation of
term discrimination values. The UDO-based computation,
however, presents advantages over the vectorbased
calculation: It is faster, easier to assess and handle
in practice, and its application is not restricted to the
vector space model. Based on the ADI test collection, it
is shown that the UDO-based Term Discrimination Value
(TDV) weighting scheme yields better retrieval effectiveness
than using the vector-based TDV weighting
scheme. Also, experimental evidence is given to the
intuition that the choice of an appropriate weighting
scheme and similarity measure depends on collection
properties, and thus the UDO approach may be used as
a theoretical basis for this intuition.
Journal title :
Journal of the American Society for Information Science and Technology
Journal title :
Journal of the American Society for Information Science and Technology