• DocumentCode
    2139043
  • Title

    Neural Network for Text Classification Based on Singular Value Decomposition

  • Author

    Li, Cheng Hua ; Park, Soon Cheol

  • Author_Institution
    Chonbuk Nat. Univ., Jeonbuk
  • fYear
    2007
  • fDate
    16-19 Oct. 2007
  • Firstpage
    47
  • Lastpage
    52
  • Abstract
    This paper proposed new text classification models based on artificial neural networks and Singular Value Decomposition (SVD). The neural networks are trained by Multi-Output Perceptron Learning algorithm (MOPL) and Back-Propagation Neural Network (BPNN). Most classic classification systems represent the contents of documents with a set of index terms, it has been known as vector space model (VSM). However, this method need a high dimensional space to represent the documents, and it dose not take into account the semantic relationship between terms, which could lead to poor classification performance. In this paper, we introduce singular value decomposition to our systems. SVD was used to learn and represent relations among very large numbers of words and very large numbers of natural text passages in which they occurred. It could not only greatly reduce the dimensional but also discover the important associative relationships between terms. It also helps to accelerate the training speed and improve the classification accuracy. We test our classification systems on the standard Reuter-21578 collection. Experimental evaluations show that the systems training with SVD are much fast then the original systems with VSM, and also achieve better classification results.
  • Keywords
    backpropagation; pattern classification; perceptrons; singular value decomposition; text analysis; artificial neural networks; backpropagation neural network; classification systems; multioutput perceptron learning; singular value decomposition; text classification; vector space model; Acceleration; Artificial neural networks; Computer networks; Data mining; Information technology; Internet; Machine learning algorithms; Neural networks; Singular value decomposition; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology, 2007. CIT 2007. 7th IEEE International Conference on
  • Conference_Location
    Aizu-Wakamatsu, Fukushima
  • Print_ISBN
    978-0-7695-2983-7
  • Type

    conf

  • DOI
    10.1109/CIT.2007.52
  • Filename
    4385055