• DocumentCode
    3025899
  • Title

    Chinese Automatic Text Summarization Based on Keyword Extraction

  • Author

    Jiang Xiao-Yu

  • Author_Institution
    Bus. Sch., Beijing Inst. of Fashion Technol., Beijing, China
  • fYear
    2009
  • fDate
    25-26 April 2009
  • Firstpage
    225
  • Lastpage
    228
  • Abstract
    In order to over the shortcoming of the incomprehensive of summarization, a new lexical-chain-based keywords extraction and automatic summarization algorithm from Chinese texts based on the unknown word recognition using co-occurrence of neighbor words is proposed in this paper, and an algorithm for constructing lexical chains based on Hownet knowledge database is given in the method, lexical chains are firstly constructing by calculating the semantic similarity between terms, then keywords are extracted and the importance of each sentence is calculated according to the lexical chain´s intensity, the terms´ entropy and position. The experimental results show that the summarization generated by the improved algorithm gets better performance than other methods both in recall and precision.
  • Keywords
    database management systems; text analysis; word processing; Chinese automatic text summarization; Hownet knowledge database; entropy; lexical-chain-based keywords extraction; position; semantic similarity; word recognition; Algorithm design and analysis; Character recognition; Clustering algorithms; Computers; Data mining; Databases; Entropy; Frequency; Statistics; Text recognition; automatic summarization; keyword extraction; lexical chain;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Technology and Applications, 2009 First International Workshop on
  • Conference_Location
    Wuhan, Hubei
  • Print_ISBN
    978-0-7695-3604-0
  • Type

    conf

  • DOI
    10.1109/DBTA.2009.9
  • Filename
    5207775