Title :
Automatic summarization of Turkish documents using non-negative matrix factorization
Author :
Aysun Güran;Nilgün Güler Bayazit;Eren Bekar
Author_Institution :
Yildiz Technical University, Istanbul, Turkey
fDate :
6/1/2011 12:00:00 AM
Abstract :
Automatic document summarization is a process, where a computer summarizes a document. This paper presents the performance analysis of an automatic Turkish document summarization system that applies Non-negative matrix factorization based summarization algorithm with different preprocessing methods. The preprocessing method called “Consecutive Words Detection” is an innovative approach that uses Turkish Wikipedia links to represent related consecutive words as a single term and the result of the evaluation process is promising for document summarization in Turkish.
Keywords :
"Semantics","Matrix decomposition","Internet","Electronic publishing","Encyclopedias","Performance evaluation"
Conference_Titel :
Innovations in Intelligent Systems and Applications (INISTA), 2011 International Symposium on
Print_ISBN :
978-1-61284-919-5
DOI :
10.1109/INISTA.2011.5946121