DocumentCode
3642090
Title
Automatic summarization of Turkish documents using non-negative matrix factorization
Author
Aysun Güran;Nilgün Güler Bayazit;Eren Bekar
Author_Institution
Yildiz Technical University, Istanbul, Turkey
fYear
2011
fDate
6/1/2011 12:00:00 AM
Firstpage
480
Lastpage
484
Abstract
Automatic document summarization is a process, where a computer summarizes a document. This paper presents the performance analysis of an automatic Turkish document summarization system that applies Non-negative matrix factorization based summarization algorithm with different preprocessing methods. The preprocessing method called “Consecutive Words Detection” is an innovative approach that uses Turkish Wikipedia links to represent related consecutive words as a single term and the result of the evaluation process is promising for document summarization in Turkish.
Keywords
"Semantics","Matrix decomposition","Internet","Electronic publishing","Encyclopedias","Performance evaluation"
Publisher
ieee
Conference_Titel
Innovations in Intelligent Systems and Applications (INISTA), 2011 International Symposium on
Print_ISBN
978-1-61284-919-5
Type
conf
DOI
10.1109/INISTA.2011.5946121
Filename
5946121
Link To Document