• DocumentCode
    675652
  • Title

    Sequential pattern based multi document summarization — An exploratory approach

  • Author

    Alias, Suraya ; Muhammad, Siti Khaotijah

  • Author_Institution
    Sch. of Eng. & Inf. Technol., Univ. Malaysia Sabah, Kota Kinabalu, Malaysia
  • fYear
    2013
  • fDate
    27-28 Nov. 2013
  • Firstpage
    85
  • Lastpage
    90
  • Abstract
    Sequential Pattern Mining which aims to discover all frequent sequences of itemsets (patterns) from a large data collection has been applied in the Text Mining domain such as Text Categorization and Pattern Identification. However, in the area of Document Summarization the effort is still considered as green and exploratory. In the real world, a sentence is more than just a collection of un-ordered sequence of words, where each sentence carries their own meaning. By discovering these textual patterns is essential since the patterns can describe the text, by preserving the sequential order of the words in the document. Thus, the motivation here is to investigate the feasibility to develop a Sequential Pattern-based Summarizer model near future in order to reduce redundancy information from multiple text resources; at same time preserving the meaning of the original text document using the Semantic similarity approach. This paper reviewed some of the existing techniques in the area of multiple document summarizations to better understand the gap and issues underlying this area. By incorporating the semantic knowledge of sentences in the multiple documents is hoped to assist and alleviate the long-winding process for non-subject expert researches in trying to find the similarities and correlation between text resources.
  • Keywords
    data mining; pattern recognition; text analysis; data collection; multidocument summarization; pattern identification; redundancy information reduction; semantic similarity approach; sentence semantic knowledge; sequential pattern mining; sequential pattern-based summarizer model; text categorization; text mining; Databases; Information retrieval; Information systems; Semantics; Technological innovation; Text mining; MultiDocument; Sequential Pattern Mining; Summarization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research and Innovation in Information Systems (ICRIIS), 2013 International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4799-2486-8
  • Type

    conf

  • DOI
    10.1109/ICRIIS.2013.6716690
  • Filename
    6716690