DocumentCode :
3025899
Title :
Chinese Automatic Text Summarization Based on Keyword Extraction
Author :
Jiang Xiao-Yu
Author_Institution :
Bus. Sch., Beijing Inst. of Fashion Technol., Beijing, China
fYear :
2009
fDate :
25-26 April 2009
Firstpage :
225
Lastpage :
228
Abstract :
In order to over the shortcoming of the incomprehensive of summarization, a new lexical-chain-based keywords extraction and automatic summarization algorithm from Chinese texts based on the unknown word recognition using co-occurrence of neighbor words is proposed in this paper, and an algorithm for constructing lexical chains based on Hownet knowledge database is given in the method, lexical chains are firstly constructing by calculating the semantic similarity between terms, then keywords are extracted and the importance of each sentence is calculated according to the lexical chain´s intensity, the terms´ entropy and position. The experimental results show that the summarization generated by the improved algorithm gets better performance than other methods both in recall and precision.
Keywords :
database management systems; text analysis; word processing; Chinese automatic text summarization; Hownet knowledge database; entropy; lexical-chain-based keywords extraction; position; semantic similarity; word recognition; Algorithm design and analysis; Character recognition; Clustering algorithms; Computers; Data mining; Databases; Entropy; Frequency; Statistics; Text recognition; automatic summarization; keyword extraction; lexical chain;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Database Technology and Applications, 2009 First International Workshop on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3604-0
Type :
conf
DOI :
10.1109/DBTA.2009.9
Filename :
5207775
Link To Document :
بازگشت