• DocumentCode
    3317348
  • Title

    Automatic subject indexing of Chinese documents

  • Author

    Zhang, Sulan ; He, Qing ; Zheng, Zheng ; Shi, Zhongzhi

  • Author_Institution
    Inst. of Comput. Technol., Chinese Acad. of Sci., Beijing, China
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    256
  • Lastpage
    261
  • Abstract
    Automatic subject indexing is a process to produce automatically a set of attributes that represent the content or topic of a document. In this paper, two approaches of automatic subject indexing based on VSM (vector space model) and subject words segmentation respectively are presented. The experimental results show that the first approach based on VSM is appropriate when the documents, which are indexed, are concentrative and the subject words available are less. The second approach based on subject words segmentation improves greatly efficiency of indexing and inter-indexer consistency.
  • Keywords
    indexing; word processing; automatic subject indexing; document indexing; inter-indexer consistency; subject word segmentation; vector space model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598744
  • Filename
    1598744