• DocumentCode
    3642090
  • Title

    Automatic summarization of Turkish documents using non-negative matrix factorization

  • Author

    Aysun Güran;Nilgün Güler Bayazit;Eren Bekar

  • Author_Institution
    Yildiz Technical University, Istanbul, Turkey
  • fYear
    2011
  • fDate
    6/1/2011 12:00:00 AM
  • Firstpage
    480
  • Lastpage
    484
  • Abstract
    Automatic document summarization is a process, where a computer summarizes a document. This paper presents the performance analysis of an automatic Turkish document summarization system that applies Non-negative matrix factorization based summarization algorithm with different preprocessing methods. The preprocessing method called “Consecutive Words Detection” is an innovative approach that uses Turkish Wikipedia links to represent related consecutive words as a single term and the result of the evaluation process is promising for document summarization in Turkish.
  • Keywords
    "Semantics","Matrix decomposition","Internet","Electronic publishing","Encyclopedias","Performance evaluation"
  • Publisher
    ieee
  • Conference_Titel
    Innovations in Intelligent Systems and Applications (INISTA), 2011 International Symposium on
  • Print_ISBN
    978-1-61284-919-5
  • Type

    conf

  • DOI
    10.1109/INISTA.2011.5946121
  • Filename
    5946121