Title :
Identification of related information of interest across free text documents
Author :
Johnson, James R. ; Miller, Alice ; Khan, Latifur ; Thuraisingham, Bhavani ; Kantarcioglu, Murat
Author_Institution :
ADB Consulting, Carson City, NV, USA
Abstract :
An approach is presented for finding information of interest in a free text document and then identifying and presenting related information of interest from other free text documents. The goal is to find specific related items of interest within documents whether the documents are of the same category or not. Information of interest is defined with respect to expanded entity phrases and their ontology mappings. Powerful techniques requiring minimal training are described for expanding an entity phrase to include attributes from components of a complex sentence; for measuring relatedness of same-name expanded entity phrases; and for detecting related expanded entity phrases through ontology inferences. A representative dataset is described and preliminary measurements of performance against ground truth are provided.
Keywords :
ontologies (artificial intelligence); text analysis; free text documents; ontology inference; ontology mapping; same-name expanded entity phrase; Markov processes; Ontologies; Semantics; Terminology; Thesauri; Vehicles; Visualization; entity extraction; information of interest; natural language processing; relatedness;
Conference_Titel :
Intelligence and Security Informatics (ISI), 2011 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-0082-8
DOI :
10.1109/ISI.2011.5984058