Title :
Mining associations by linear inequalities
Author_Institution :
Dept. of Comput. Sci., San Jose State Univ., CA, USA
Abstract :
The main theorem is: generalized associations of a relational table can be found by a finite set of linear inequalities within polynomial time. It is derived from the following three results, which were established in ICDMO´02 and are re-developed here. They are: (1) isomorphic theorem: isomorphic relations have isomorphic patterns. Such an isomorphism classifies relational tables into isomorphic classes. (2) A variant of the classical bitmaps indexes uniquely exists in each isomorphic class. We take it as the canonical model of the class. (3) All possible attributes/features can be generated by a generalized procedure of the classical AOG (attribute oriented generalization). Then, (4) the main theorem for canonical model is established. By isomorphism theorem, we had the final result (5).
Keywords :
data mining; pattern classification; set theory; theorem proving; association mining; attribute oriented generalization; bitmaps indexes; canonical model; finite set; generalized associations; isomorphic classes; isomorphic patterns; isomorphic relations; isomorphic theorem; linear inequalities; polynomial time; relational table; Anatomy; Association rules; Computer science; Data mining; Data processing; Frequency; Humans; Logic functions; Polynomials; Software systems; association; bitmaps; deduction; feature; granules;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10098