Title :
Heuristic Algorithm for Interpretation of Non-Atomic Categorical Attributes in Similarity-based Fuzzy Databases Scalability Evaluation
Author :
Hossain, M. Shahriar ; Angryk, Rafal A.
Author_Institution :
Montana State Univ. Bozeman, Bozeman
Abstract :
In this work we are analyzing scalability of the heuristic algorithm we used in the past [1-4] to discover knowledge from multi-valued symbolic attributes in fuzzy databases. The non-atomic descriptors, characterizing a single attribute of a database record, are commonly used in fuzzy databases to reflect uncertainty about the recorded observation. In this paper, we present implementation details and scalability tests of the algorithm, which we developed to precisely interpret such non-atomic values and to transfer (i.e. de fuzzify) the fuzzy tuples to the forms acceptable for many regular (i.e. atomic values based) data mining algorithms. Important advantages of our approach are: (1) its linear scalability, and (2) its unique capability of incorporating background knowledge, implicitly stored in the fuzzy database models in the form of fuzzy similarity hierarchy, into the interpretation/defuzzification process.
Keywords :
data analysis; data mining; data models; fuzzy set theory; heuristic programming; relational databases; uncertainty handling; data anlaysis; data mining algorithms; defuzzification process; fuzzy similarity hierarchy; fuzzy tuples; heuristic algorithm; interpretation process; multivalued symbolic attributes; nonatomic categorical attributes; scalability evaluation; similarity-based fuzzy relational database models; Algorithm design and analysis; Computer science; Data analysis; Data mining; Fuzzy sets; Heuristic algorithms; Relational databases; Scalability; Spatial databases; Uncertainty;
Conference_Titel :
Fuzzy Information Processing Society, 2007. NAFIPS '07. Annual Meeting of the North American
Conference_Location :
San Diego, CA
Print_ISBN :
1-4244-1213-7
Electronic_ISBN :
1-4244-1214-5
DOI :
10.1109/NAFIPS.2007.383843