• DocumentCode
    2724067
  • Title

    Validity of Probabilistic Rules

  • Author

    Sapir, Marina ; Teverovskiy, Mikhail

  • Author_Institution
    Aureon Labs., Yonkers, NY
  • fYear
    2007
  • fDate
    March 1 2007-April 5 2007
  • Firstpage
    6
  • Lastpage
    9
  • Abstract
    We propose an axiomatic approach to defining of the validity of probabilistic inductive rules E rArr H. The set of rules is evaluated against an available dataset, where the conditions E, H are either true or false for each instance in the dataset. Introduced here are six axioms which formalize common sense dependencies between the validity of rules and their support, confidence, lift and amount of available evidence. Having a single validity measure, contrary to multiple criteria, helps compare and rank induced rules. We demonstrate that the z-test of difference of proportions satisfies all the axioms and can be used as a measure of rules validity. Knowing that the z-test statistics is normally distributed, allows one to filter out statistically unreliable rules. We demonstrate advantages of the proposed approach on a real life medical dataset
  • Keywords
    data mining; probability; statistical testing; common sense dependencies; medical dataset; probabilistic inductive rules; validity measure; z-test statistics; Association rules; Computational intelligence; Data analysis; Data mining; Filters; Production; Statistical distributions; USA Councils;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Data Mining, 2007. CIDM 2007. IEEE Symposium on
  • Conference_Location
    Honolulu, HI
  • Print_ISBN
    1-4244-0705-2
  • Type

    conf

  • DOI
    10.1109/CIDM.2007.368845
  • Filename
    4221269