DocumentCode
2139043
Title
Neural Network for Text Classification Based on Singular Value Decomposition
Author
Li, Cheng Hua ; Park, Soon Cheol
Author_Institution
Chonbuk Nat. Univ., Jeonbuk
fYear
2007
fDate
16-19 Oct. 2007
Firstpage
47
Lastpage
52
Abstract
This paper proposed new text classification models based on artificial neural networks and Singular Value Decomposition (SVD). The neural networks are trained by Multi-Output Perceptron Learning algorithm (MOPL) and Back-Propagation Neural Network (BPNN). Most classic classification systems represent the contents of documents with a set of index terms, it has been known as vector space model (VSM). However, this method need a high dimensional space to represent the documents, and it dose not take into account the semantic relationship between terms, which could lead to poor classification performance. In this paper, we introduce singular value decomposition to our systems. SVD was used to learn and represent relations among very large numbers of words and very large numbers of natural text passages in which they occurred. It could not only greatly reduce the dimensional but also discover the important associative relationships between terms. It also helps to accelerate the training speed and improve the classification accuracy. We test our classification systems on the standard Reuter-21578 collection. Experimental evaluations show that the systems training with SVD are much fast then the original systems with VSM, and also achieve better classification results.
Keywords
backpropagation; pattern classification; perceptrons; singular value decomposition; text analysis; artificial neural networks; backpropagation neural network; classification systems; multioutput perceptron learning; singular value decomposition; text classification; vector space model; Acceleration; Artificial neural networks; Computer networks; Data mining; Information technology; Internet; Machine learning algorithms; Neural networks; Singular value decomposition; Text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on
Conference_Location
Aizu-Wakamatsu, Fukushima
Print_ISBN
978-0-7695-2983-7
Type
conf
DOI
10.1109/CIT.2007.52
Filename
4385055
Link To Document