• DocumentCode
    468156
  • Title

    Chinese Automatic Summarization Based on Thematic Sentence Discovery

  • Author

    Wang, Meng ; Li, Chungui ; Wang, Xiaorong

  • Author_Institution
    GuangXi Univ. of Technol., Liuzhou
  • Volume
    1
  • fYear
    2007
  • fDate
    24-27 Aug. 2007
  • Firstpage
    482
  • Lastpage
    486
  • Abstract
    In this paper, we propose a practical approach for extracting the most relevant sentences from the original document to form a summary. The idea of our approach is to obtain summary based on similarity of thematic sentences, which use terms as features rather than words, and employs term length term frequency (TLTF) to compute weight of terms to obtain features. Furthermore, it uses an improved k-means method to cluster sentences, and compute similarity of thematic sentences according to clustering results. Experimental results indicate a clear superiority of the proposed method over the traditional method under the proposed evaluation scheme.
  • Keywords
    natural languages; pattern clustering; text analysis; Chinese automatic summarization; improved k-means method; term length term frequency; thematic sentence discovery; Artificial intelligence; Clustering algorithms; Data mining; Feature extraction; Frequency; Humans; Information retrieval; Internet; Natural language processing; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
  • Conference_Location
    Haikou
  • Print_ISBN
    978-0-7695-2874-8
  • Type

    conf

  • DOI
    10.1109/FSKD.2007.214
  • Filename
    4405972