DocumentCode
243497
Title
Latent Factor SVM for Text Categorization
Author
Xiaofei Zhou ; Li Guo ; Ping Liu ; Yanbing Liu
Author_Institution
Inst. of Inf. Eng., Beijing, China
fYear
2014
fDate
14-14 Dec. 2014
Firstpage
105
Lastpage
110
Abstract
Text categorization is an important research in nature language process and content analysis. In this paper, we present latent factor SVM (LF-SVM) for text categorization which use latent factor vectors for category representation on text categorization. We prove that latent factors extracted by PLSA (probability latent semantic analysis) can span convex structure to express text category. Based on the category expression we adopt maximal margin hyper plane to divide the categories. The experiments on normal text datasets show that our motivation and algorithm are reasonable and effective.
Keywords
natural language interfaces; statistical analysis; support vector machines; text analysis; LF-SVM; category representation; content analysis; convex structure; latent factor SVM; latent factor vectors; maximal margin hyper plane; nature language process; probability latent semantic analysis; text categorization; text datasets; Feature extraction; Semantics; Support vector machine classification; Text categorization; Training; Vectors; PLSA; SVM; Text categorization; latent semantic; text classification;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location
Shenzhen
Print_ISBN
978-1-4799-4275-6
Type
conf
DOI
10.1109/ICDMW.2014.9
Filename
7022586
Link To Document