Title :
A neuro-SVM model for text classification using latent semantic indexing
Author :
Mitra, Vikramjit ; Wang, Chia-Jiu ; Banerjee, Satarupa
Author_Institution :
Dept. of Electr. Eng., Worcester Polytech. Inst., MA, USA
fDate :
31 July-4 Aug. 2005
Abstract :
This paper presents a new model integrating a recurrent neural network (RNN) and a least squares support vector machine (LS-SVM) for classification of document titles according to different predetermined categories. The new model proposed in this paper is abbreviated as neuro-SVM. Based on the neuro-SVM model, a system is implemented, using latent semantic indexing (LSI) to generate probabilistic coefficients from document titles, which are used as the input to the system. The system´s performance is demonstrated with a corpus of 96956 words, from University of Denver´s Penrose library catalogue and the accuracy rate of the proposed system is found to be 99.66%.
Keywords :
least squares approximations; pattern classification; recurrent neural nets; support vector machines; text analysis; LS-SVM; RNN; document title classification; latent semantic indexing; least squares support vector machine; neuro-SVM model; recurrent neural network; text classification; Indexing; Information retrieval; Large scale integration; Libraries; Material storage; Recurrent neural networks; Sparse matrices; Support vector machine classification; Support vector machines; Text categorization;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1555893