Title :
Effective XML Classification Using Content and Structural Information via Rule Learning
Author :
Costa, Gianni ; Ortale, Riccardo ; Ritacco, Ettore
Author_Institution :
ICAR-CNR, Naples, Italy
Abstract :
We propose a new approach to XML classification, that uses a particular rule-learning technique for the induction of interpretable classification models. These separate the individual classes of XML documents by looking at the presence within the XML documents themselves of certain features, that provide information on their content and structure. The devised approach induces classifiers with outperforming effectiveness in comparison to several established competitors.
Keywords :
XML; document handling; knowledge based systems; learning (artificial intelligence); pattern classification; XML classification; XML document; interpretable classification model induction; rule learning; structural information; Context; Data models; Databases; Predictive models; Training data; Vegetation; XML;
Conference_Titel :
Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4577-2068-0
Electronic_ISBN :
1082-3409
DOI :
10.1109/ICTAI.2011.24