Title :
Chinese Text Classification Based on Summarization Technique
Author :
Jiang, Xiao-Yu ; Fan, Xiao-Zhong ; Chen, Kang
Author_Institution :
Beijing Inst. of Technol., Beijing
Abstract :
Two approaches to text classification based on summarization technique are proposed: in the first approach, the heuristic rules of auto-summarization are used to select and weight features for every category, and texts are classified by these features; in the second approach, the text summarization is directly used for classification instead of the original text. Experimental results show that the combination of summarization technology and classification technology can not only reduce the time of feature selection and classification but also improve the performance of text classification.
Keywords :
abstracting; classification; feature extraction; natural language processing; text analysis; Chinese text classification; auto-summarization; feature selection; heuristic rules; text summarization; Clothing; Computer science; Data mining; Feature extraction; Robustness; Statistical analysis; Testing; Text categorization; Web pages;
Conference_Titel :
Semantics, Knowledge and Grid, Third International Conference on
Conference_Location :
Shan Xi
Print_ISBN :
0-7695-3007-9
Electronic_ISBN :
978-0-7695-3007-9
DOI :
10.1109/SKG.2007.68