Title :
A hybrid approach to automatic text summarization
Author :
Chang, Te-Min ; Hsiao, Wen-Feng
Author_Institution :
Dept. of Inf. Manage., Nat. Sun Yat-sen Univ., Kaohsiung
Abstract :
Automatic text summarization is to compress an original document into an abridged version by extracting almost all of the essential concepts with text mining techniques. This research focuses on developing a hybrid automatic text summarization approach, KCS, to enhancing the quality of summaries. KCS employs the K-mixture probabilistic model to establish term weights in a statistical sense, and further identifies the term relationships to derive the connective strength (CS) of nouns. Sentences are ranked and extracted based on their CS values. We conduct two experiments to justify the proposed approach. The quality of extracted summary is examined by its capability of increasing text classification accuracy. The results show that our proposed approach, KCS, performs best among all approaches considered. It implies that KCS can extract more representative sentences from the document and its feasibility in text summarization applications is thus justified.
Keywords :
data mining; natural language processing; probability; text analysis; K-mixture probabilistic model; automatic text summarization; connective strength; text mining; Business; Data mining; Information management; Information retrieval; Pattern analysis; Tagging; Text analysis; Text categorization; Text mining; Text processing; automatic text summarization; linguistic approach; statistical approach;
Conference_Titel :
Computer and Information Technology, 2008. CIT 2008. 8th IEEE International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
978-1-4244-2357-6
Electronic_ISBN :
978-1-4244-2358-3
DOI :
10.1109/CIT.2008.4594651