• DocumentCode
    468314
  • Title

    Learning Locality Discriminating Indexing for Text Categorization

  • Author

    Hu, Jiani ; Deng, Weihong ; Guo, Jun ; Xu, Weiran

  • Author_Institution
    Beijing Univ. of Posts & Telecommun., Beijing
  • Volume
    3
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    239
  • Lastpage
    242
  • Abstract
    This paper introduces a locality discriminating indexing (LDI) algorithm for text categorization. The LDI algorithm offers a manifold way of discriminant analysis. Based on the hypothesis that samples from different classes reside in class-specific manifold structures, the algorithm depicts the manifold structures by a nearest-native graph and a invader graphs. And a new locality discriminant criterion is proposed, which best preserves the within-class local structures while suppresses the between-class overlap. Using the notion of the Laplacian of the graphs, the LDI algorithm finds the optimal linear transformation by solving the generalized eigenvalue problem. The feasibility of the LDI algorithm has been successfully tested in text categorization using 20NG and Reuters-21578 databases. Experiment results show LDI is an effective technique for document modeling and representations for classification.
  • Keywords
    eigenvalues and eigenfunctions; graphs; indexing; text analysis; discriminant analysis; document modeling; generalized eigenvalue problem; invader graph; learning; locality discriminant criterion; locality discriminating indexing algorithm; nearest-native graph; optimal linear transformation; text categorization; Algorithm design and analysis; Clustering algorithms; Eigenvalues and eigenfunctions; Indexing; Laplace equations; Large scale integration; Linear discriminant analysis; Scattering; Testing; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.383
  • Filename
    4406236