DocumentCode
2194547
Title
Automatic Summarization for Chinese Text Based on Combined Words Recognition and Paragraph Clustering
Author
Jiang Chang-jin ; Peng Hong ; Ma Qian-li ; Chen Jian-chao
Author_Institution
Sch. of Comput. Sci. & Eng., South China Univ. of Technol., Guangzhou, China
fYear
2010
fDate
2-4 April 2010
Firstpage
591
Lastpage
594
Abstract
With the tremendous amount of information available electronically, there is an increasing requirement for automatic text summarization systems. An extractive summarization method is represented. The weight of a Chinese word/phrase is computed based on its frequency, part of speech, position and length. The weight of a Chinese sentence is computed by its content, position, length and cue words in it. The adjacent paragraphs are clustered into same cluster or different clusters according to their similarity. The experiment results show that the proposed algorithm has a significantly better performance compared with the traditional automated summarization algorithms based on TF-ISF method.
Keywords
pattern clustering; text analysis; Chinese phrase computation; Chinese sentence computation; Chinese word computation; TF-ISF method; automatic Chinese text summarization systems; combined words recognition; extractive summarization method; paragraph clustering; Artificial intelligence; Clustering algorithms; Data mining; Dictionaries; Frequency; Information retrieval; Information technology; Natural languages; Speech; Text recognition; Chinese combined word; automatic summarization; paragraph clustering; weight computation;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Information Technology and Security Informatics (IITSI), 2010 Third International Symposium on
Conference_Location
Jinggangshan
Print_ISBN
978-1-4244-6730-3
Electronic_ISBN
978-1-4244-6743-3
Type
conf
DOI
10.1109/IITSI.2010.15
Filename
5453695
Link To Document