Title :
Information extraction by an abstractive text summarization for an Indian regional language
Author :
Kallimani, Jagadish S. ; Srinivasa, K.G. ; Eswara Reddy, B.
Author_Institution :
Dept. of CSE, MSRIT, Bangalore, India
Abstract :
The Internet provides many sources of different opinions, expressed through user reviews of products, blogs, and forum discussions. Systems which could automatically summarize these opinions would be immensely useful for those who wish to use this information to make decisions. The previous work in automatic summarization has completely focused on extractive summarization, in which key sentences are identified from the source text and extracted to form the output. An alternative solution is abstractive summarization in which the information from the source text is first extracted into the form of abstract data which is then post processed to infer the most important message from the original text. This work is built upon past work of extractive summarization methods to create abstractive summaries by creating new sentences in it. This paper conveys the methodology for the abstractive summarization process and its evaluation considering Telugu, a south Indian regional language, as the language of study.
Keywords :
Internet; Web sites; information retrieval; natural language processing; text analysis; Internet; Telugu; abstract data; abstractive text summarization; blog; extractive summarization; forum discussion; information extraction; key sentence identification; opinion source; opinion summarization; product user review; source text; south Indian regional language; Conferences; Data mining; Natural language processing; Speech; Tagging; Automatic text summarization; Extractive and abstractive summarization; Information retrieval; Stemming; Word count frequency;
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
DOI :
10.1109/NLPKE.2011.6138217