DocumentCode :
594739
Title :
Non-negative Sparse Semantic Coding for text categorization
Author :
Wenbin Zheng ; Yuntao Qian
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
fYear :
2012
fDate :
11-15 Nov. 2012
Firstpage :
409
Lastpage :
412
Abstract :
In text categorization, the dimensionality reduction methods, such as latent semantic indexing and nonnegative matrix factorization, commonly yield the dense representation that is not consistent with our common knowledge. On the other hand, the popular sparse coding methods are time-consuming and their dictionaries might contain negative entries, which is difficulty to interpret the semantic meaning of text. This paper proposes a novel Non-negative Sparse Semantic Coding (NSSC) approach for text reprentation. NSSC provides an efficient algorithm to construct a set of non-negative basis vectors that span a low dimensional semantic subspace, where each document obtains a non-negative sparse representation corresponding to these basis vectors. Extensive experimental results have shown that the proposed approach achieves a good performance and presents more interpretability with respect to these semantic concepts.
Keywords :
dictionaries; indexing; matrix decomposition; pattern classification; text analysis; NSSC approach; dense representation; dimensionality reduction methods; latent semantic indexing; low dimensional semantic subspace; negative entries; nonnegative basis vectors; nonnegative matrix factorization; nonnegative sparse representation; nonnegative sparse semantic coding; nonnegative sparse semantic coding approach; popular sparse coding methods; semantic concepts; semantic text meaning; text categorization; text reprentation; Dictionaries; Encoding; Large scale integration; Semantics; Text categorization; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
ISSN :
1051-4651
Print_ISBN :
978-1-4673-2216-4
Type :
conf
Filename :
6460158
Link To Document :
بازگشت