DocumentCode :
3401301
Title :
Generic multi-document summarization using cluster refinement and NMF
Author :
Park, Sun ; An, Dong Un ; Cho, Youn Jeong
Author_Institution :
Adv. Grad. Educ. Center of Jeonbuk for Electron. & Inf. Technol.-BK21, Chonbuk Nat. Univ., Jeonju, South Korea
fYear :
2009
fDate :
14-17 Dec. 2009
Firstpage :
65
Lastpage :
70
Abstract :
In this paper, a generic summarization method that uses cluster refinement and NMF is introduced to extract meaningful sentences from documents. The proposed method uses cluster refinement to improve the quality of document clustering since it helps us to remove dissimilarity information easily and avoid biased inherent semantics of documents to be reflected in clusters by NMF. In addition, it uses the weighted semantic variable to select meaningful sentences because the extracted sentences are well covered with the major topics of document. The experimental results demonstrate that the proposed method has better performance than other methods that use the other methods.
Keywords :
feature extraction; pattern clustering; text analysis; cluster refinement; document clustering; generic multidocument summarization; meaningful sentence extraction; Clustering algorithms; Data mining; Diversity methods; Diversity reception; Educational technology; Feature extraction; Internet; Sun; Vectors; NMF; cluster coherence; cluster refinement; geneirc multi-document summarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology (ISSPIT), 2009 IEEE International Symposium on
Conference_Location :
Ajman
Print_ISBN :
978-1-4244-5949-0
Type :
conf
DOI :
10.1109/ISSPIT.2009.5407492
Filename :
5407492
Link To Document :
بازگشت