DocumentCode
2892893
Title
Using Multiple Sets of Attributes for Text Categorization
Author
Bi, Ya-xin ; Zhang, Qiang ; Wu, Sheng-li ; Guan, Ji-wen
Author_Institution
Sch. of Comput. & Math., Ulster Univ., Newtownabbey
fYear
2006
fDate
13-16 Aug. 2006
Firstpage
2252
Lastpage
2256
Abstract
This paper investigates how multiple sets of attributes can be generated using a rough sets-based inductive learning method and how they can be combined for improving classification decisions, particularly in the context of text categorization, by using Dempster\´s rule of combination. We first propose a boosting-like technique for generating multiple sets of attributes based on rough set theory, and a method for transforming multiple sets of attributes to multiple sets of rules, and then model classification decisions inferred by the rules as pieces of evidence. The various experiments have been carried out on 10 out of the 20-newsgroups - a benchmark data collection ndividually and in combination. Our experimental results support the claim that "decisions made by multiple experts would be more effective than any one if their individual judgments are appropriately combined"
Keywords
inference mechanisms; learning (artificial intelligence); pattern classification; rough set theory; text analysis; Dempster combination rule; ensemble method; inductive learning; information fusion; multiple attribute set; pattern classification; rough set theory; text categorization; Bismuth; Boosting; Computer science; Cybernetics; Educational institutions; Electronic mail; Learning systems; Machine learning; Mathematics; Pattern recognition; Set theory; Text categorization; Inductive learning; ensemble methods; information fusion; text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2006 International Conference on
Conference_Location
Dalian, China
Print_ISBN
1-4244-0061-9
Type
conf
DOI
10.1109/ICMLC.2006.258668
Filename
4028439
Link To Document