DocumentCode :
3425831
Title :
Hierarchically classifying documents with multiple labels
Author :
Mayne, Andrew ; Perry, Russell
Author_Institution :
Oxford Univ., Oxford
fYear :
2009
fDate :
March 30 2009-April 2 2009
Firstpage :
133
Lastpage :
139
Abstract :
This paper describes the evaluation of a hierarchical classifier for classifying multi-labeled documents organized in a two-level taxonomy. The hierarchical classifier consists of a tree of independent naive Bayes classifiers, with output probabilities from parent classifiers propagated to child classifiers as additional features. Each classifier uses Bi-Normal Feature Separation for word feature selection. Experiments were performed using the Weka Toolkit adapted to deal with multi-labeled documents. The hierarchical classifier accuracy marginally out-performed a set of independent binary classifiers trained to classify documents for each class in the taxonomy.
Keywords :
Bayes methods; document handling; feature extraction; pattern classification; probability; trees (mathematics); bi-normal feature separation; hierarchical document classification; independent naive Bayes classifier tree; multi labeled document; output probability; word feature selection; Classification tree analysis; Feature extraction; Feedback; Feeds; Information retrieval; Information technology; Search engines; Taxonomy; User interfaces; Wikipedia;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Data Mining, 2009. CIDM '09. IEEE Symposium on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2765-9
Type :
conf
DOI :
10.1109/CIDM.2009.4938640
Filename :
4938640
Link To Document :
بازگشت