• DocumentCode
    3043263
  • Title

    An Efficient Vietnamese Text Summarization Approach Based on Graph Model

  • Author

    Hoang, Tu Anh Nguyen ; Nguyen, Hoang Khai ; Tran, Quang Vinh

  • Author_Institution
    Fac. of Inf. Technol., Univ. of Sci., Ho Chi Minh City, Vietnam
  • fYear
    2010
  • fDate
    1-4 Nov. 2010
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper proposes an automatic method to generate an extractive summary of multiple Vietnamese documents which are related to a common topic by modeling text documents as weighted undirected graphs. It initially builds undirected graphs with vertices representing the sentences of documents and edges indicate the similarity between sentences. Then, by adopting PageRank algorithm, we can generate salient scores for sentences. Sentences are ranked according to their salient scores and selected based on Maximal marginal relevance to form the summaries. These summaries are combined and applied the same process one more time to form the final extractive summary of the document set. A series of experiments are performed on Vietnamese news articles. The results demonstrate the effectiveness of the proposed technique over reference systems.
  • Keywords
    graph theory; text analysis; PageRank algorithm; Vietnamese documents; Vietnamese text summarization approach; graph model; maximal marginal relevance; weighted undirected graphs; Computational modeling; Data mining; Lead; Machine learning; Meteorology; Pragmatics; Redundancy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computing and Communication Technologies, Research, Innovation, and Vision for the Future (RIVF), 2010 IEEE RIVF International Conference on
  • Conference_Location
    Hanoi
  • Print_ISBN
    978-1-4244-8074-6
  • Type

    conf

  • DOI
    10.1109/RIVF.2010.5633162
  • Filename
    5633162