DocumentCode :
2962317
Title :
Automatic document summarizer
Author :
Harris, A. ; Oussalah, M.
Author_Institution :
Dept. of Electron., Electr. & Comput. Eng., Univ. of Birmingham, Birmingham
fYear :
2008
fDate :
9-10 Sept. 2008
Firstpage :
1
Lastpage :
6
Abstract :
The need for automatic summarization becomes crucial with the exponential increase of data available on www and digital libraries, which makes the search for relevant pieces of information a difficult task. Therefore, needless to say, an automatic summarizer would save a lot to user and operators. However, the development of such systems is also shown to be very challenging due to inherent difficulty in dealing with natural language processing, the subjectivity of the evaluation process and the limitation of the mathematical models. This paper puts forward a proposal for an automatic summarizer system, which explicitly makes use of the semantic relatedness of document sentences using WordNet taxonomy.On the other hand, several other attributes are taken into account in the design stage, which include similarity to user´s query, if any, which allows us to integrate some personalization to the outcome, similarity to the document title, location and frequency of co-occurrence. The developed summarizer also enables some search abilities, where three distinct search platforms were integrated (Lucene, GTP and Xapian). The obtained results were compared to MEAD summarizer.
Keywords :
abstracting; document handling; natural language processing; WordNet taxonomy; automatic document summarizer system; automatic summarization; document sentences; document title; mathematical models; natural language processing; search platforms; user query; Buildings; Data engineering; Frequency; Information processing; Internet; Mathematical model; Natural language processing; Proposals; Software libraries; World Wide Web; information processing; semantic relatedness; summarization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cybernetic Intelligent Systems, 2008. CIS 2008. 7th IEEE International Conference on
Conference_Location :
London
Print_ISBN :
978-1-4244-2914-1
Electronic_ISBN :
978-1-4244-2915-8
Type :
conf
DOI :
10.1109/UKRICIS.2008.4798921
Filename :
4798921
Link To Document :
بازگشت