• DocumentCode
    243497
  • Title

    Latent Factor SVM for Text Categorization

  • Author

    Xiaofei Zhou ; Li Guo ; Ping Liu ; Yanbing Liu

  • Author_Institution
    Inst. of Inf. Eng., Beijing, China
  • fYear
    2014
  • fDate
    14-14 Dec. 2014
  • Firstpage
    105
  • Lastpage
    110
  • Abstract
    Text categorization is an important research in nature language process and content analysis. In this paper, we present latent factor SVM (LF-SVM) for text categorization which use latent factor vectors for category representation on text categorization. We prove that latent factors extracted by PLSA (probability latent semantic analysis) can span convex structure to express text category. Based on the category expression we adopt maximal margin hyper plane to divide the categories. The experiments on normal text datasets show that our motivation and algorithm are reasonable and effective.
  • Keywords
    natural language interfaces; statistical analysis; support vector machines; text analysis; LF-SVM; category representation; content analysis; convex structure; latent factor SVM; latent factor vectors; maximal margin hyper plane; nature language process; probability latent semantic analysis; text categorization; text datasets; Feature extraction; Semantics; Support vector machine classification; Text categorization; Training; Vectors; PLSA; SVM; Text categorization; latent semantic; text classification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • Print_ISBN
    978-1-4799-4275-6
  • Type

    conf

  • DOI
    10.1109/ICDMW.2014.9
  • Filename
    7022586