Title :
Web Mining for Open Source Intelligence
Author_Institution :
Joint Res. Centre, OSVision Ltd., London
Abstract :
Web mining for open source intelligence is the retrieval, extraction and analysis of information from on-line Internet sites. There are two separate applications areas this paper will review, namely live news-monitoring and targeted topic based data mining. Most newspapers and news agencies have Web sites with live updates on unfolding events, opinions and perspectives on world events. Most governments monitor news reports to feel the pulse of public opinion, and for early warning of emerging crises. The Joint Research Centre has developed significant experience in Internet content monitoring through its work on media monitoring (EMM) for the European Commission. EMM forms the core of the Commission´s daily press monitoring service. Intelligence services and law enforcement agencies also require specific site monitoring and topic monitoring, and EMM technology has been applied to the wider Internet for this purpose. The software extracts and downloads all the textual content from monitored sites and applies information extraction techniques. These tools help analysts process large amounts of documents to derive structured data. Lastly the visualisation of the extracted data is important for analysts to identify patterns and trends derived from both news reports and Web mining.
Keywords :
Internet; Web sites; content management; data mining; data visualisation; information retrieval; Internet content monitoring; Web mining; Web site; data mining; data visualisation; information analysis; information extraction; information retrieval; intelligence service; live news-monitoring; media monitoring; online Internet site; open source intelligence; Data mining; Data visualization; Government; Information analysis; Information retrieval; Law enforcement; Monitoring; Pattern analysis; Web and internet services; Web mining; Information Extraction; Media Monitoring; Multilinguality; Visualisation; Web Mining;
Conference_Titel :
Information Visualisation, 2008. IV '08. 12th International Conference
Conference_Location :
London
Print_ISBN :
978-0-7695-3268-4