DocumentCode :
262862
Title :
Obtaining knowledge from the Web using fusion and summarization techniques
Author :
Escudero, Sandra ; Garrido, Angel L. ; Ilarri, Sergio
Author_Institution :
IIS Dept., Univ. of Zaragoza, Zaragoza, Spain
fYear :
2014
fDate :
7-10 July 2014
Firstpage :
1
Lastpage :
8
Abstract :
Nowadays, information and knowledge are fundamental in our society. This has induced an information overload problem in the Internet. For this reason, we propose to create an automatic system to retrieve, select, and extract information from the Web whose methodology is based on fusion techniques. The system, called Diana, facilitates and improves the identification of interesting contents, and it allows to extract the most relevant information about a certain topic from the Web as a summary. To do this, we have developed algorithms that use semantic tools, Natural Language Processing (NLP) techniques, statistics, a generic gazetteer, and fusion methods. The development of the system is undergoing, but the preliminary results that we have obtained so far are very promising and show the interest of our proposal.
Keywords :
Internet; information retrieval; natural language processing; sensor fusion; statistical analysis; Diana system; Internet; NLP techniques; Web; fusion methods; fusion technique; generic gazetteer; information extraction; information retrieval; information selection; natural language processing; semantic tools; statistics; summarization technique; Data integration; Feature extraction; HTML; Search engines; Semantics; Web pages; NLP; Summaries; fusion; text mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Fusion (FUSION), 2014 17th International Conference on
Conference_Location :
Salamanca
Type :
conf
Filename :
6916038
Link To Document :
بازگشت