DocumentCode
2861985
Title
Class Selection Based Iterative Supervised Latent Semantic Indexing for Text Categorization
Author
Wang, Ming-Bo ; Liu, Cheng-Lin
Author_Institution
Nat. Lab. of Pattern Recognition (NLPR), Chinese Acad. of Sci., Beijing, China
fYear
2009
fDate
19-20 Dec. 2009
Firstpage
1
Lastpage
4
Abstract
Latent semantic indexing (LSI) is an effective technique for feature extraction in text mining, and supervised LSI (SLSI) algorithms have been proposed to exploit the class labels of training data. In this paper, we propose an iterative SLSI framework based on class selection. We show that a previous iterative SLSI algorithm is an instance of the framework. We also propose a method under our framework, which selects a class at each iteration using a simple classifier and computes the main bias vector of one class only. Our experiments demonstrate that the proposed method both improves the classification accuracy and reduces the computation cost.
Keywords
data mining; feature extraction; indexing; text analysis; class selection; feature extraction; iterative supervised latent semantic indexing; text categorization; text mining; Feature extraction; Indexing; Iterative algorithms; Laboratories; Large scale integration; Matrix decomposition; Principal component analysis; Text categorization; Text mining; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-4994-1
Type
conf
DOI
10.1109/ICIECS.2009.5366100
Filename
5366100
Link To Document