DocumentCode :
1882115
Title :
Global Pattern Search at Scale
Author :
Crouser, R. Jordan ; Schmidt, Matthew C. ; Kelley, Stephen ; Miller, Benjamin ; Hook, Daniel ; Edwards, Lauren ; Milosavljevic, Maja ; Michel, Elizabeth ; Ferme, Elizabeth ; Carrington, Robert ; Reuther, Albert I.
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
fYear :
2015
fDate :
14-16 April 2015
Firstpage :
1
Lastpage :
6
Abstract :
In recent years, data collection has far outpaced the tools for data analysis in the area of non-traditional GEOINT analysis. Traditional tools are designed to analyze small-scale numerical data, but there are few good interactive tools for processing large amounts of unstructured data such as raw text. In addition to the complexities of data processing, presenting the data in a way that is meaningful to the end user poses another challenge. In our work, we focused on analyzing a corpus of 35,000 news articles and creating an interactive geovisualization tool to reveal patterns to human analysts. Our comprehensive tool, Global Pattern Search at Scale (GPSS), addresses three major problems in data analysis: free text analysis, high volumes of data, and interactive visualization. GPSS uses an Accumulo database for high-volume data storage, and a matrix of word counts and event detection algorithms to process the free text. For visualization, the tool displays an interactive web application to the user, featuring a map overlaid with document clusters and events, search and filtering options, a timeline, and a word cloud. In addition, the GPSS tool can be easily adapted to process and understand other large free-text datasets.
Keywords :
Internet; data analysis; data visualisation; geophysics computing; text analysis; Accumulo database; GPSS; document clusters; event detection algorithms; free text analysis; global pattern search at scale; high-volume data storage; interactive Web application; interactive geovisualization tool; interactive tools; interactive visualization; nontraditional GEOINT analysis; raw text; small-scale numerical data analysis; unstructured data processing; word counts; Algorithm design and analysis; Context; Data visualization; Event detection; Geospatial analysis; Market research; Tag clouds;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Technologies for Homeland Security (HST), 2015 IEEE International Symposium on
Conference_Location :
Waltham, MA
Print_ISBN :
978-1-4799-1736-5
Type :
conf
DOI :
10.1109/THS.2015.7225293
Filename :
7225293
Link To Document :
بازگشت