Title :
Improved Multi Label Classification in Hierarchical Taxonomies
Author :
Punera, Kunal ; Rajan, Suju
Abstract :
Hierarchical taxonomies are used to organize and retrieve information in many domains, especially those dealing with large and rapidly growing amounts of information. In many of these domains data also tends to be multi-label in nature. In this paper, we consider the problem of automated text classification in these scenarios. We present a post-processing based approach that performs smoothing on the output of an underlying one-vs-all ensemble. In order to do this we formulate a Regularized Unimodal Regression problem and give an exact algorithm to solve it. We evaluate the performance of our approach on several real-world large-scale multi-label hierarchical taxonomies and demonstrate that our proposed method provides significant gains over other related approaches.
Keywords :
classification; information retrieval; regression analysis; text analysis; automated text classification; hierarchical taxonomies; information retrieval; multi label classification; regularized unimodal regression; Availability; Conferences; Data mining; Information retrieval; Large-scale systems; Navigation; Performance gain; Smoothing methods; Taxonomy; Text categorization;
Conference_Titel :
Data Mining Workshops, 2009. ICDMW '09. IEEE International Conference on
Conference_Location :
Miami, FL
Print_ISBN :
978-1-4244-5384-9
Electronic_ISBN :
978-0-7695-3902-7
DOI :
10.1109/ICDMW.2009.110