DocumentCode
2506975
Title
A hybrid approach to automatic text summarization
Author
Chang, Te-Min ; Hsiao, Wen-Feng
Author_Institution
Dept. of Inf. Manage., Nat. Sun Yat-sen Univ., Kaohsiung
fYear
2008
fDate
8-11 July 2008
Firstpage
65
Lastpage
70
Abstract
Automatic text summarization is to compress an original document into an abridged version by extracting almost all of the essential concepts with text mining techniques. This research focuses on developing a hybrid automatic text summarization approach, KCS, to enhancing the quality of summaries. KCS employs the K-mixture probabilistic model to establish term weights in a statistical sense, and further identifies the term relationships to derive the connective strength (CS) of nouns. Sentences are ranked and extracted based on their CS values. We conduct two experiments to justify the proposed approach. The quality of extracted summary is examined by its capability of increasing text classification accuracy. The results show that our proposed approach, KCS, performs best among all approaches considered. It implies that KCS can extract more representative sentences from the document and its feasibility in text summarization applications is thus justified.
Keywords
data mining; natural language processing; probability; text analysis; K-mixture probabilistic model; automatic text summarization; connective strength; text mining; Business; Data mining; Information management; Information retrieval; Pattern analysis; Tagging; Text analysis; Text categorization; Text mining; Text processing; automatic text summarization; linguistic approach; statistical approach;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Technology, 2008. CIT 2008. 8th IEEE International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-1-4244-2357-6
Electronic_ISBN
978-1-4244-2358-3
Type
conf
DOI
10.1109/CIT.2008.4594651
Filename
4594651
Link To Document