• DocumentCode
    3106359
  • Title

    A Balanced Ensemble Approach to Weighting Classifiers for Text Classification

  • Author

    Fung, Gabriel Pui Cheong ; Yu, Jeffrey Xu ; Wang, Haixun ; Cheung, David W. ; Liu, Huan

  • Author_Institution
    Chinese Univ. of Hong Kong, Kowloon
  • fYear
    2006
  • fDate
    18-22 Dec. 2006
  • Firstpage
    869
  • Lastpage
    873
  • Abstract
    This paper studies the problem of constructing an effective heterogeneous ensemble classifier for text classification. One major challenge of this problem is to formulate a good combination function, which combines the decisions of the individual classifiers in the ensemble. We show that the classification performance is affected by three weight components and they should be included in deriving an effective combination function. They are: (1) Global effectiveness, which measures the effectiveness of a member classifier in classifying a set of unseen documents; (2) Local effectiveness, which measures the effectiveness of a member classifier in classifying the particular domain of an unseen document; and (3) Decision confidence, which describes how confident a classifier is when making a decision when classifying a specific unseen document. We propose a new balanced combination function, called dynamic classifier weighting (DCW), that incorporates the aforementioned three components. The empirical study demonstrates that the new combination function is highly effective for text classification.
  • Keywords
    classification; text analysis; balanced combination function; balanced ensemble approach; dynamic classifier weighting; heterogeneous ensemble classifier; specific unseen document; text classification; weighting classifiers; Bagging; Boosting; Labeling; Particle measurements; Robustness; Stacking; Support vector machine classification; Support vector machines; Text categorization; Voting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining, 2006. ICDM '06. Sixth International Conference on
  • Conference_Location
    Hong Kong
  • ISSN
    1550-4786
  • Print_ISBN
    0-7695-2701-7
  • Type

    conf

  • DOI
    10.1109/ICDM.2006.2
  • Filename
    4053118