Title :
Automatic Construction of N-ary Tree Based Taxonomies
Author :
Punera, Kunal ; Rajan, Suju ; Ghosh, Joydeep
Author_Institution :
Dept. of Electr. & Comput. Eng., Texas Univ., Austin, CA
Abstract :
Hierarchies are an intuitive and effective organization paradigm for data. Of late there has been considerable research on automatically learning hierarchical organizations of data. In this paper, we explore the problem of learning n-ary tree based hierarchies of categories with no user-defined parameters. We propose a framework that characterizes a "good" taxonomy and also provide an algorithm to find it. This algorithm works completely automatically (with no user input) and is significantly less greedy than existing algorithms in literature. We evaluate our approach on multiple real life datasets from diverse domains, such as text mining, hyper-spectral analysis, written character recognition etc. Our experimental results show that not only are n-ary trees based taxonomies more "natural", but also the output space decompositions induced by these taxonomies for many datasets yield better classification accuracies as opposed to classification on binary tree based taxonomies
Keywords :
learning (artificial intelligence); trees (mathematics); automatic construction; automatic learning; binary tree; data recognition; hierarchical organization; n-ary tree; organization paradigm; taxonomies; user-defined parameters; Binary trees; Classification tree analysis; Costs; Handwriting recognition; Labeling; Libraries; Navigation; Taxonomy; Text mining; Web sites;
Conference_Titel :
Data Mining Workshops, 2006. ICDM Workshops 2006. Sixth IEEE International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2702-7
DOI :
10.1109/ICDMW.2006.35