• DocumentCode
    2506975
  • Title

    A hybrid approach to automatic text summarization

  • Author

    Chang, Te-Min ; Hsiao, Wen-Feng

  • Author_Institution
    Dept. of Inf. Manage., Nat. Sun Yat-sen Univ., Kaohsiung
  • fYear
    2008
  • fDate
    8-11 July 2008
  • Firstpage
    65
  • Lastpage
    70
  • Abstract
    Automatic text summarization is to compress an original document into an abridged version by extracting almost all of the essential concepts with text mining techniques. This research focuses on developing a hybrid automatic text summarization approach, KCS, to enhancing the quality of summaries. KCS employs the K-mixture probabilistic model to establish term weights in a statistical sense, and further identifies the term relationships to derive the connective strength (CS) of nouns. Sentences are ranked and extracted based on their CS values. We conduct two experiments to justify the proposed approach. The quality of extracted summary is examined by its capability of increasing text classification accuracy. The results show that our proposed approach, KCS, performs best among all approaches considered. It implies that KCS can extract more representative sentences from the document and its feasibility in text summarization applications is thus justified.
  • Keywords
    data mining; natural language processing; probability; text analysis; K-mixture probabilistic model; automatic text summarization; connective strength; text mining; Business; Data mining; Information management; Information retrieval; Pattern analysis; Tagging; Text analysis; Text categorization; Text mining; Text processing; automatic text summarization; linguistic approach; statistical approach;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Technology, 2008. CIT 2008. 8th IEEE International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-1-4244-2357-6
  • Electronic_ISBN
    978-1-4244-2358-3
  • Type

    conf

  • DOI
    10.1109/CIT.2008.4594651
  • Filename
    4594651