Title :
Chinese Automatic Summarization Based on Thematic Sentence Discovery
Author :
Wang, Meng ; Li, Chungui ; Wang, Xiaorong
Author_Institution :
GuangXi Univ. of Technol., Liuzhou
Abstract :
In this paper, we propose a practical approach for extracting the most relevant sentences from the original document to form a summary. The idea of our approach is to obtain summary based on similarity of thematic sentences, which use terms as features rather than words, and employs term length term frequency (TLTF) to compute weight of terms to obtain features. Furthermore, it uses an improved k-means method to cluster sentences, and compute similarity of thematic sentences according to clustering results. Experimental results indicate a clear superiority of the proposed method over the traditional method under the proposed evaluation scheme.
Keywords :
natural languages; pattern clustering; text analysis; Chinese automatic summarization; improved k-means method; term length term frequency; thematic sentence discovery; Artificial intelligence; Clustering algorithms; Data mining; Feature extraction; Frequency; Humans; Information retrieval; Internet; Natural language processing; Statistics;
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
DOI :
10.1109/FSKD.2007.214