Title :
Research on Automatic Summarization System Based on Topic Partition
Author_Institution :
Math., Phys. & Inf. Eng. Coll., Zhejiang Normal Univ., Jinhua, China
Abstract :
Automatic summarization is an important issue in natural language processing and becomes more and more important in some domains such as web information retrieval. This paper describes an automatic summarization system based on topic partition: Firstly, it analyzes the text structure of the web pages, then proposes an algorithm for multi-topic text partitioning based on semantic distance between paragraphs, and finally get the summarization by extracting the key sentences from different topics. This method can makes the abstract of multi-topic article have more general content and more balanced structure. The experimental result shows that its value is stable.
Keywords :
Internet; abstracting; information retrieval; natural language processing; text analysis; Web information retrieval; Web pages; abstraction; automatic summarization system; key sentence extraction; multitopic article; multitopic text partitioning; natural language processing; text structure analysis; topic partition; Artificial intelligence; Educational institutions; HTML; Information retrieval; Information systems; Internet; Mathematics; Pattern matching; Physics; Web pages; automatic summarization; information retrieval; text structure; topic partition;
Conference_Titel :
Web Information Systems and Mining, 2009. WISM 2009. International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3817-4
DOI :
10.1109/WISM.2009.24