DocumentCode :
468156
Title :
Chinese Automatic Summarization Based on Thematic Sentence Discovery
Author :
Wang, Meng ; Li, Chungui ; Wang, Xiaorong
Author_Institution :
GuangXi Univ. of Technol., Liuzhou
Volume :
1
fYear :
2007
fDate :
24-27 Aug. 2007
Firstpage :
482
Lastpage :
486
Abstract :
In this paper, we propose a practical approach for extracting the most relevant sentences from the original document to form a summary. The idea of our approach is to obtain summary based on similarity of thematic sentences, which use terms as features rather than words, and employs term length term frequency (TLTF) to compute weight of terms to obtain features. Furthermore, it uses an improved k-means method to cluster sentences, and compute similarity of thematic sentences according to clustering results. Experimental results indicate a clear superiority of the proposed method over the traditional method under the proposed evaluation scheme.
Keywords :
natural languages; pattern clustering; text analysis; Chinese automatic summarization; improved k-means method; term length term frequency; thematic sentence discovery; Artificial intelligence; Clustering algorithms; Data mining; Feature extraction; Frequency; Humans; Information retrieval; Internet; Natural language processing; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
Type :
conf
DOI :
10.1109/FSKD.2007.214
Filename :
4405972
Link To Document :
بازگشت