Title :
Non-negative Sparse Semantic Coding for text categorization
Author :
Wenbin Zheng ; Yuntao Qian
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
Abstract :
In text categorization, the dimensionality reduction methods, such as latent semantic indexing and nonnegative matrix factorization, commonly yield the dense representation that is not consistent with our common knowledge. On the other hand, the popular sparse coding methods are time-consuming and their dictionaries might contain negative entries, which is difficulty to interpret the semantic meaning of text. This paper proposes a novel Non-negative Sparse Semantic Coding (NSSC) approach for text reprentation. NSSC provides an efficient algorithm to construct a set of non-negative basis vectors that span a low dimensional semantic subspace, where each document obtains a non-negative sparse representation corresponding to these basis vectors. Extensive experimental results have shown that the proposed approach achieves a good performance and presents more interpretability with respect to these semantic concepts.
Keywords :
dictionaries; indexing; matrix decomposition; pattern classification; text analysis; NSSC approach; dense representation; dimensionality reduction methods; latent semantic indexing; low dimensional semantic subspace; negative entries; nonnegative basis vectors; nonnegative matrix factorization; nonnegative sparse representation; nonnegative sparse semantic coding; nonnegative sparse semantic coding approach; popular sparse coding methods; semantic concepts; semantic text meaning; text categorization; text reprentation; Dictionaries; Encoding; Large scale integration; Semantics; Text categorization; Training; Vectors;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4