DocumentCode
468156
Title
Chinese Automatic Summarization Based on Thematic Sentence Discovery
Author
Wang, Meng ; Li, Chungui ; Wang, Xiaorong
Author_Institution
GuangXi Univ. of Technol., Liuzhou
Volume
1
fYear
2007
fDate
24-27 Aug. 2007
Firstpage
482
Lastpage
486
Abstract
In this paper, we propose a practical approach for extracting the most relevant sentences from the original document to form a summary. The idea of our approach is to obtain summary based on similarity of thematic sentences, which use terms as features rather than words, and employs term length term frequency (TLTF) to compute weight of terms to obtain features. Furthermore, it uses an improved k-means method to cluster sentences, and compute similarity of thematic sentences according to clustering results. Experimental results indicate a clear superiority of the proposed method over the traditional method under the proposed evaluation scheme.
Keywords
natural languages; pattern clustering; text analysis; Chinese automatic summarization; improved k-means method; term length term frequency; thematic sentence discovery; Artificial intelligence; Clustering algorithms; Data mining; Feature extraction; Frequency; Humans; Information retrieval; Internet; Natural language processing; Statistics;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location
Haikou
Print_ISBN
978-0-7695-2874-8
Type
conf
DOI
10.1109/FSKD.2007.214
Filename
4405972
Link To Document