• DocumentCode
    2807755
  • Title

    Document Representation Using Nonnegative Matrix Factorization

  • Author

    Pei, XiaoBing ; Xiao, Laiyuan ; Chen, Changqing

  • Author_Institution
    Coll. of Software, HuaZhong Universirty of Sci. & Technol., Wuhan, China
  • fYear
    2009
  • fDate
    11-13 Dec. 2009
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Non-negative matrix factorization (NMF) is an emerging technique of latent semantic analysis from the given document corpus. The existing NMF algorithms don not use the intrinsic structure information of original document corpus. In order to preserve intrinsic structure information in latent semantic space extracted by NMF, a NMF algorithm with intrinsic structure information properties is presented. The primary ideal is to extend the original NMF through incorporating the intrinsic structure information constraints inside the NMF decomposition. Our experimental results performed on the RCV1 and SECTOR data sets show that the proposed method is superior to NMF for document latent semantic analysis.
  • Keywords
    document handling; information retrieval; matrix decomposition; SECTOR data sets; document representation; intrinsic structure information constraints; latent semantic analysis; nonnegative matrix factorization; vector space information retrieval; Data mining; Educational institutions; Indexing; Information retrieval; Large scale integration; Matrix decomposition; Performance analysis; Space technology; Sparse matrices; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Software Engineering, 2009. CiSE 2009. International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-4507-3
  • Electronic_ISBN
    978-1-4244-4507-3
  • Type

    conf

  • DOI
    10.1109/CISE.2009.5362815
  • Filename
    5362815