Title of article :
Contingency matrix theory: Statistical dependence in a contingency table
Author/Authors :
Shusaku Tsumoto، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2009
Abstract :
Chance discovery aims at understanding the meaning of functional dependency from the viewpoint of unexpected relations. One of the most important observations is that such a chance is hidden under a huge number of coocurrencies extracted from a given data. On the other hand, conventional data-mining methods are strongly dependent on frequencies and statistics rather than interestingness or unexpectedness. This paper discusses some limitations of ideas of statistical dependence, especially focusing on the formal characteristics of Simpson’s paradox from the viewpoint of linear algebra. Theoretical results show that such a Simpson’s paradox can be observed when a given contingency table as a matrix is not regular, in other words, the rank of a contingency matrix is not full. Thus, data-ordered evidence gives some limitations, which should be compensated by human-oriented reasoning.
Keywords :
Contingency matrix , linear algebra , Simpson’s paradox , Statistical independence
Journal title :
Information Sciences
Journal title :
Information Sciences