DocumentCode :
2487032
Title :
Research of Text Classification Model Based on Latent Semantic Analysis and Improved HS-SVM
Author :
Zhang, Yu-feng ; He, Chao
Author_Institution :
Center for Studies of Inf. Resources, Wuhan Univ., Wuhan, China
fYear :
2010
fDate :
22-23 May 2010
Firstpage :
1
Lastpage :
5
Abstract :
A text classification model based on Latent Semantic Analysis and Improved Hyper-sphere Support Vector Machine, is proposed in order to improve the accuracy and efficiency of text classification. Latent Semantic Analysis is used in this model for feature extraction, eliminating the text representation errors caused by synonyms and polysemes, and reducing the dimension of text vector. At the same time, a new decision-making approach based on concentration, is designed for improving the classification of texts in overlapping regions. Experimental results show that the model will give good classification results when the number of classes is small. And the improved algorithm is effective and feasible.
Keywords :
decision making; feature extraction; support vector machines; text analysis; vectors; decision making; feature extraction; improved HS-SVM; improved hypersphere support vector machine; latent semantic analysis; text classification model; text representation error elimination; text vector dimension; Feature extraction; Information analysis; Information filtering; Information resources; Machine learning; Machine learning algorithms; Quadratic programming; Support vector machine classification; Support vector machines; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems and Applications (ISA), 2010 2nd International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-5872-1
Electronic_ISBN :
978-1-4244-5874-5
Type :
conf
DOI :
10.1109/IWISA.2010.5473702
Filename :
5473702
Link To Document :
بازگشت