Title :
A new text classification model based on the sentence space
Author :
Zhu, Tie-Dan ; Zhao, Xin-Xin ; Liu, Yu-shu
Author_Institution :
Sch. of Inf. Sci. & Technol., Beijing Inst. of Technol., China
Abstract :
This paper proposes a sentence space model, which expresses a text by sentence units and keeps the structure of the original text. It accomplishes the TC task by the sentence-class contribution and the double-voting method. Comparing with the VSM, this model has a higher classification accuracy in many data sets.
Keywords :
Bayes methods; text analysis; Naive Bayes; double-voting; sentence space model; sentence-class contribution; text classification model; vector space model; Computer science; Electronic mail; Information science; Labeling; Machine learning; Space technology; Support vector machine classification; Support vector machines; Text categorization; Web mining; Naive Bayes; Text Classification; Vector Space Model;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1527232