• DocumentCode
    1958211
  • Title

    A proposal for a model for dealing with value-based data dependencies to improve the rule discovery process

  • Author

    Giuffrida, Giovanni ; Cutello, Vincenzo

  • Author_Institution
    Dept. of Comput. Sci., California Univ., Los Angeles, CA, USA
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1025
  • Abstract
    The discovery of conjunctive “if-then” classification rules may be intractable when enumerating all possible conjunctions of terms. Various algorithms, notably C4.5 and CART, adopt a univariate strategy which reduces the process to a one-at-a-time best variable type of approach. While computationally feasible, such an approach may lead to unexplored portions of the database which may contain valuable nuggets. On the other hand, an exhaustive evaluation of all possible conjunctions may be intractable even for relatively small datasets. We propose a general approach to reduce the size of the search space of conjunctive “if-then” rule discovery algorithms by exploiting value-based data dependencies existing among the independent variables
  • Keywords
    data mining; fuzzy set theory; conjunctive if-then classification rules; independent variables; one-at-a-time best variable approach; rule discovery process; value-based data dependencies; Back; Cities and towns; Computer science; Data mining; Entropy; Pregnancy; Proposals; Redundancy; Relational databases; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems, 2000. FUZZ IEEE 2000. The Ninth IEEE International Conference on
  • Conference_Location
    San Antonio, TX
  • ISSN
    1098-7584
  • Print_ISBN
    0-7803-5877-5
  • Type

    conf

  • DOI
    10.1109/FUZZY.2000.839190
  • Filename
    839190