• DocumentCode
    2864804
  • Title

    Fuzzy Genetic Semantic Based Text Summarization

  • Author

    Suanmali, Ladda ; Salim, Naomie ; Binwahlan, Mohammed Salem

  • Author_Institution
    Fac. of Sci. & Technol., Suan Dusit Rajabhat Univ., Bangkok, Thailand
  • fYear
    2011
  • fDate
    12-14 Dec. 2011
  • Firstpage
    1184
  • Lastpage
    1191
  • Abstract
    Automatic text summarization is a data reduction process to exclude unnecessary details and present important information in a shorter version. One way to summarize document is by extracting important sentences in the document. To select suitable sentences, a numerical rank is assigned to each sentence based on a sentence scoring approach. Highly ranked sentences are used for the summary. This paper proposed an automatic text summarization approach based on sentence extraction using fuzzy logic, genetic algorithm, semantic role labeling and their combinations to generate high quality summaries. This study explored the benefits of the genetic algorithm in the optimization problem in for feature selection during the training phase and adjusts feature weights during the testing phase. Fuzzy IF-THEN rules were used to balance the weights between important and unimportant features. Conventional extraction methods cannot capture semantic relations between concepts in a text. Therefore, this research investigates the use of the semantic role labeling to capture the semantic contents in sentences and incorporate it into the summarization method. This paper is evaluated in terms of performance using ROUGE toolkit. Experimental results showed that the summaries produced by the proposed approaches are better than other approaches produced by Microsoft Word 2007, Copernic Summarizer, and MANYASPECTS summarizers.
  • Keywords
    data reduction; fuzzy logic; fuzzy set theory; genetic algorithms; performance evaluation; semantic networks; text analysis; Copernic Summarizer; MANYASPECTS summarizers; Microsoft Word 2007; ROUGE toolkit; automatic text summarization approach; conventional extraction methods; data reduction process; document summarization; feature selection; feature weights; fuzzy IF-THEN rules; fuzzy genetic semantic based text summarization; fuzzy logic; genetic algorithm; high quality summary; highly ranked sentences; numerical rank; optimization problem; performance evaluation; semantic contents; semantic relations; semantic role labeling; sentence extraction; sentence scoring approach; summarization method; testing phase; training phase; Biological cells; Feature extraction; Fuzzy logic; Genetic algorithms; Labeling; Semantics; Vectors; Fuzzy logic; Genetic algorithm; Semantic role labeling; Sentence extraction; Statistical method; Text summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Dependable, Autonomic and Secure Computing (DASC), 2011 IEEE Ninth International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-4673-0006-3
  • Type

    conf

  • DOI
    10.1109/DASC.2011.192
  • Filename
    6118856