• DocumentCode
    2892893
  • Title

    Using Multiple Sets of Attributes for Text Categorization

  • Author

    Bi, Ya-xin ; Zhang, Qiang ; Wu, Sheng-li ; Guan, Ji-wen

  • Author_Institution
    Sch. of Comput. & Math., Ulster Univ., Newtownabbey
  • fYear
    2006
  • fDate
    13-16 Aug. 2006
  • Firstpage
    2252
  • Lastpage
    2256
  • Abstract
    This paper investigates how multiple sets of attributes can be generated using a rough sets-based inductive learning method and how they can be combined for improving classification decisions, particularly in the context of text categorization, by using Dempster\´s rule of combination. We first propose a boosting-like technique for generating multiple sets of attributes based on rough set theory, and a method for transforming multiple sets of attributes to multiple sets of rules, and then model classification decisions inferred by the rules as pieces of evidence. The various experiments have been carried out on 10 out of the 20-newsgroups - a benchmark data collection ndividually and in combination. Our experimental results support the claim that "decisions made by multiple experts would be more effective than any one if their individual judgments are appropriately combined"
  • Keywords
    inference mechanisms; learning (artificial intelligence); pattern classification; rough set theory; text analysis; Dempster combination rule; ensemble method; inductive learning; information fusion; multiple attribute set; pattern classification; rough set theory; text categorization; Bismuth; Boosting; Computer science; Cybernetics; Educational institutions; Electronic mail; Learning systems; Machine learning; Mathematics; Pattern recognition; Set theory; Text categorization; Inductive learning; ensemble methods; information fusion; text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2006 International Conference on
  • Conference_Location
    Dalian, China
  • Print_ISBN
    1-4244-0061-9
  • Type

    conf

  • DOI
    10.1109/ICMLC.2006.258668
  • Filename
    4028439