Title :
Processing online news streams for large-scale semantic analysis
Author :
Miloš Krstajić;Florian Mansmann;Andreas Stoffel;Martin Atkinson;Daniel A. Keim
Author_Institution :
University of Konstanz, Germany
fDate :
3/1/2010 12:00:00 AM
Abstract :
While Internet has enabled us to access a vast amount of online news articles originating from thousands of different sources, the human capability to read all these articles has stayed rather constant. Usually, the publishing industry takes over the role of filtering this enormous amount of information and presenting it in an appropriate way to the group of their subscribers. In this paper, the semantic analysis of such news streams is discussed by introducing a system that streams online news collected by the Europe Media Monitor to our proposed semantic news analysis system. Thereby, we describe in detail the emerging challenges and the corresponding engineering solutions to process incoming articles close to real-time. To demonstrate the use of our system, the case studies show a) temporal analysis of entities, such as institutions or persons, and b) their co-occurence in news articles.
Keywords :
"Large-scale systems","Humans","Internet","Information filtering","Information filters","Streaming media","Monitoring","Law","Legal factors","Data security"
Conference_Titel :
Data Engineering Workshops (ICDEW), 2010 IEEE 26th International Conference on
Print_ISBN :
978-1-4244-6522-4
DOI :
10.1109/ICDEW.2010.5452710