Title :
Text categorization method based on extension theory
Author :
Yi, Yong ; Zheng, Yan ; He, Zhongshi ; Wu, Zhongfu
Author_Institution :
Comput. Sci. Inst., Chongqing Univ., China
Abstract :
We introduce a new text categorization method utilizing machine learning based on extension theory. This dependent degree based on the extension theory represents the extent to which the element belongs to the predefined categories. The "closeness degree" between the input document vector and standard range of each predefined category can be calculated. The new method is conceptually simple; it can be used with relatively low complexity and high flexibility: The algorithm is highly scalable. It can be effectively applied to text categorization, of which various features are consecutive values. Furthermore, this algorithm can be widely applied to computational linguistics.
Keywords :
classification; computational complexity; computational linguistics; learning (artificial intelligence); text analysis; computational linguistics; document vector; extension theory; machine learning; text categorization method; text classification algorithm; Classification algorithms; Classification tree analysis; Computational linguistics; Computer science; Decision trees; Helium; Nearest neighbor searches; Regression tree analysis; Text categorization; Training data;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
DOI :
10.1109/NLPKE.2003.1275986