• DocumentCode
    2861985
  • Title

    Class Selection Based Iterative Supervised Latent Semantic Indexing for Text Categorization

  • Author

    Wang, Ming-Bo ; Liu, Cheng-Lin

  • Author_Institution
    Nat. Lab. of Pattern Recognition (NLPR), Chinese Acad. of Sci., Beijing, China
  • fYear
    2009
  • fDate
    19-20 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Latent semantic indexing (LSI) is an effective technique for feature extraction in text mining, and supervised LSI (SLSI) algorithms have been proposed to exploit the class labels of training data. In this paper, we propose an iterative SLSI framework based on class selection. We show that a previous iterative SLSI algorithm is an instance of the framework. We also propose a method under our framework, which selects a class at each iteration using a simple classifier and computes the main bias vector of one class only. Our experiments demonstrate that the proposed method both improves the classification accuracy and reduces the computation cost.
  • Keywords
    data mining; feature extraction; indexing; text analysis; class selection; feature extraction; iterative supervised latent semantic indexing; text categorization; text mining; Feature extraction; Indexing; Iterative algorithms; Laboratories; Large scale integration; Matrix decomposition; Principal component analysis; Text categorization; Text mining; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4994-1
  • Type

    conf

  • DOI
    10.1109/ICIECS.2009.5366100
  • Filename
    5366100