• DocumentCode
    2912911
  • Title

    Automated Trainable Summarizer For Financial Documents

  • Author

    Sureka, R. ; Kong, H.P.H.

  • Author_Institution
    Nanyang Technological University
  • fYear
    2006
  • fDate
    16-20 Oct. 2006
  • Firstpage
    55
  • Lastpage
    55
  • Abstract
    The overload of information available on the Internet has made text mining and simplified news and articles browsing an increasingly important user concern and a prioritized research issue. The aim of our project is to build a light-weight and effective text mining and summarization engine for the financial domain. This engine should also be easily trainable and adaptable to other domains. This paper describes a robust trainable user-focused summarizer for the financial domain that is adapted from algorithms by Kupiec, Pedersen and Chen (KPC) [4] and Lee, Goh and Kong [5]. It employs an adapted feature set to improve the robustness of the algorithm and to incorporate domain specificity in the engine. Evaluation tests verified the improved performance of our user-focused, domain-oriented, corpus-based approach over domain-independent approaches.
  • Keywords
    Abstracts; Costs; Data mining; Frequency; Humans; Internet; Robustness; Search engines; Testing; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Enterprise Distributed Object Computing Conference Workshops, 2006. EDOCW '06. 10th IEEE International
  • Conference_Location
    Hong Kong, China
  • Print_ISBN
    0-7695-2743-4
  • Type

    conf

  • DOI
    10.1109/EDOCW.2006.24
  • Filename
    4031314