DocumentCode :
3642090
Title :
Automatic summarization of Turkish documents using non-negative matrix factorization
Author :
Aysun Güran;Nilgün Güler Bayazit;Eren Bekar
Author_Institution :
Yildiz Technical University, Istanbul, Turkey
fYear :
2011
fDate :
6/1/2011 12:00:00 AM
Firstpage :
480
Lastpage :
484
Abstract :
Automatic document summarization is a process, where a computer summarizes a document. This paper presents the performance analysis of an automatic Turkish document summarization system that applies Non-negative matrix factorization based summarization algorithm with different preprocessing methods. The preprocessing method called “Consecutive Words Detection” is an innovative approach that uses Turkish Wikipedia links to represent related consecutive words as a single term and the result of the evaluation process is promising for document summarization in Turkish.
Keywords :
"Semantics","Matrix decomposition","Internet","Electronic publishing","Encyclopedias","Performance evaluation"
Publisher :
ieee
Conference_Titel :
Innovations in Intelligent Systems and Applications (INISTA), 2011 International Symposium on
Print_ISBN :
978-1-61284-919-5
Type :
conf
DOI :
10.1109/INISTA.2011.5946121
Filename :
5946121
Link To Document :
بازگشت