Title :
IntentFinder: A system for discovering significant information implicit in large, heterogeneous document collections and computationally mapping social networks and command nodes
Author :
Ungar, Lyle ; Leibholz, Stephen ; Chaski, Carole
Author_Institution :
Comput. & Inf. Sci, Univ. of Pennsylvania, Philadelphia, PA, USA
Abstract :
IntentFinder is a computational method of extracting mutually relevant information from a large collection of narrative data. We describe an approach that takes advantage of a new view of documents as coming from evolving stories. IntentFinder consists of six main components: 1) A document management system; 2) A story extraction system; 3) A significance determination system; 4) A reputation management; 5) A lexical-semantic analysis; 6) A user interface. In addition a method has been found for quantitatively determining the topology and hierarchy of a social subnetwork embedded inside a very noisy self-reorganizing network (e.g., the Internet). All these components will work together to allow analysts to discover and understand events and stories implicit in collections of documents, including newswire, reports, emails and tweets, which would be prohibitively difficult to uncover manually, and ultimately estimating the organizational structure of a social network.
Keywords :
document handling; information retrieval; social networking (online); user interfaces; IntentFinder; Internet; command nodes; document management system; heterogeneous document collections; lexical-semantic analysis; mutually relevant information extraction; noisy self-reorganizing network; reputation management; significance determination system; social networks; social subnetwork; story extraction system; topology; user interface; Correlation; Data mining; Network topology; Organizations; Social network services; Topology; User interfaces; Documents; Intelligence; Lexical; Messages; Reputation; Semantic; Story integration;
Conference_Titel :
Technologies for Homeland Security (HST), 2011 IEEE International Conference on
Conference_Location :
Waltham, MA
Print_ISBN :
978-1-4577-1375-0
DOI :
10.1109/THS.2011.6107874