DocumentCode
2912911
Title
Automated Trainable Summarizer For Financial Documents
Author
Sureka, R. ; Kong, H.P.H.
Author_Institution
Nanyang Technological University
fYear
2006
fDate
16-20 Oct. 2006
Firstpage
55
Lastpage
55
Abstract
The overload of information available on the Internet has made text mining and simplified news and articles browsing an increasingly important user concern and a prioritized research issue. The aim of our project is to build a light-weight and effective text mining and summarization engine for the financial domain. This engine should also be easily trainable and adaptable to other domains. This paper describes a robust trainable user-focused summarizer for the financial domain that is adapted from algorithms by Kupiec, Pedersen and Chen (KPC) [4] and Lee, Goh and Kong [5]. It employs an adapted feature set to improve the robustness of the algorithm and to incorporate domain specificity in the engine. Evaluation tests verified the improved performance of our user-focused, domain-oriented, corpus-based approach over domain-independent approaches.
Keywords
Abstracts; Costs; Data mining; Frequency; Humans; Internet; Robustness; Search engines; Testing; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Enterprise Distributed Object Computing Conference Workshops, 2006. EDOCW '06. 10th IEEE International
Conference_Location
Hong Kong, China
Print_ISBN
0-7695-2743-4
Type
conf
DOI
10.1109/EDOCW.2006.24
Filename
4031314
Link To Document