• DocumentCode
    442056
  • Title

    Automatic text categorization based on angle distribution

  • Author

    Liu, Tao ; Guo, Jun

  • Author_Institution
    Sch. of Inf. Eng., Beijing Univ. of Posts & Telecommun., China
  • Volume
    6
  • fYear
    2005
  • fDate
    18-21 Aug. 2005
  • Firstpage
    3797
  • Abstract
    In order to improve the performance of Chinese text categorization, a new Chinese text categorization method based on angle distribution is presented. The new method describes the text with a more precise model and proposed a new categorization algorithm by employing angle distribution. Simulation results on open Chinese text collection show that the precision and recall of most classes have been increased with reference to the classical method, and the macro average of precision and recall are both about 72 percents, which certificating the effectiveness and feasibility of the angle distribution-based algorithm.
  • Keywords
    indexing; information retrieval; text analysis; angle distribution; automatic Chinese text categorization method; text similarity; Classification tree analysis; Content based retrieval; Distributed computing; Information retrieval; Machine learning; Machine learning algorithms; Nearest neighbor searches; Regression tree analysis; Text categorization; Web sites; Text categorization; angle distribution; similarity;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
  • Conference_Location
    Guangzhou, China
  • Print_ISBN
    0-7803-9091-1
  • Type

    conf

  • DOI
    10.1109/ICMLC.2005.1527601
  • Filename
    1527601