• DocumentCode
    389294
  • Title

    Weak learning algorithm for multi-label multiclass text categorization

  • Author

    Xu, Yan-yong ; Zhou, Xian-Zhong ; Guo, Zhong-wei

  • Author_Institution
    Dept. of Autom., Nanjing Univ. of Sci. & Technol., China
  • Volume
    2
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    890
  • Abstract
    To handle the multi-label multiclass text categorization, a weak learning algorithm (WLA) is presented. The main idea of WLA is to find a highly accurate classification rule by combining many weak hypotheses, each of which may be only moderately accurate. We used a separate procedure, called the weak learner, to compute the weak hypotheses, and found a set of weak hypotheses by calling the weak learner repeatedly in a series of rounds. These weak hypotheses were then combined into a single rule called the final hypothesis, and the final hypothesis ranked the possible labels for a given document with the hope that the appropriate labels would appear at the top of the ranking. Using the three designed evaluation measures - ordinary-error, average-coverage and average-precision - our experiments show that the performance of WLA is generally better than the other algorithms on the same dataset.
  • Keywords
    category theory; classification; learning (artificial intelligence); text analysis; classification rule; final hypothesis; machine-learning; multiple label multiclass; text categorization; weak hypotheses; weak learning algorithm; Algorithm design and analysis; Automation; Decision trees; Filtering; Internet; Large-scale systems; Machine learning; Neural networks; Space technology; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
  • Print_ISBN
    0-7803-7508-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2002.1174511
  • Filename
    1174511