Title :
Two-Level Automatic Classification applied to Bulky Data Bases
Author :
Cherif, Dorsaf ; Ben Ahmed, Mohamed
Author_Institution :
Ecole Nationale des Sci. de l´´Informatique, La Manouba Univ.
Abstract :
Classifying the data of a bulky base on the basis of high number of attributes is not an easy task because of the scarcity of the adequate methods present in the literature. These methods generally resort to the reduction of the number of data using sampling techniques or the analysis in principal components (APC). Problems are often encountered, namely the complexity of calculation, the slowness of execution and the relevance of the results. We developed for this purpose, an approach of two-level automatic classification allowing to transform a bulky base into an exploitable group of classes for the extraction of knowledge and decision-making. The robustness, the precision and the optimality of our approach are shown through its comparison with the traditional approach of classification (classification of the original data base), and this, through the results produced following the application of two approaches to a bulky data base. These results include both the clusters and the knowledge map formed by association rules generated on the original base on the one hand, and the summary of BIRCH on the other hand
Keywords :
computational complexity; data mining; data reduction; database management systems; decision making; decision trees; pattern classification; principal component analysis; sampling methods; self-organising feature maps; BIRCH; KMEANS; Kohonen maps; analysis in principal components; association rules generation; automatic classification; bulky data bases; data classification; data reduction; decision trees; decision-making; knowledge extraction; knowledge map; sampling techniques; Association rules; Classification algorithms; Clustering algorithms; Data mining; Decision making; Iterative algorithms; Laboratories; Principal component analysis; Robustness; Sampling methods; Automatic Classification; BIRCH Algorithm; KOHONEN maps; association rules; decision trees; method KMEANS;
Conference_Titel :
Information and Communication Technologies, 2006. ICTTA '06. 2nd
Conference_Location :
Damascus
Print_ISBN :
0-7803-9521-2
DOI :
10.1109/ICTTA.2006.1684993