Title :
Intelligent Searching using Sentence Context
Author :
Chickinsky, Alan
Author_Institution :
Chief Scientist, 4801 Stonecroft Blvd., Chantilly, VA 20151, (703) 633-8300 ext. 8554, Alan.Chickinsky@ngc.com
Abstract :
Fusion centers have access to terra bytes of information from both businesses and federal, state and local governments. The information ranges from computer generated databases to collections of notes with transcript of interviews performed by law enforcement personnel. Searching notes and transcripts is difficult and time consuming because humans do not use a comment set of phrases. Phrases vary due to past experiences, origin of birth and generational differences. Search engines try to compensate for these differences by performing context searches. Context searches replace specific words in the search request with other predetermined words. One can reduce false positives with an intelligent search based on grammar and English sentence structure. Intelligent sentence searching converts the each document into a set of simple sentences using only words in the predefined dictionary. These simple sentences capture the essence of the document. The conversion methodology uses synonyms, idiomatic expressions, grammar, patterns of speech and word location to create a searchable index. Because of the limited dictionary and elimination of most ambiguities, searches can be free of false positives. This paper describes the sentence context methodology, examples, and test results for a representative law enforcement report.
Keywords :
grammars; information retrieval; legislation; English sentence structure; computer generated database; grammar; intelligent sentence searching; law enforcement personnel; predefined dictionary; search engines; Databases; Dictionaries; Fusion power generation; Humans; Intelligent structures; Law enforcement; Local government; Personnel; Search engines; Speech;
Conference_Titel :
Technologies for Homeland Security, 2008 IEEE Conference on
Conference_Location :
Waltham, MA
Print_ISBN :
978-1-4244-1977-7
Electronic_ISBN :
978-1-4244-1978-4
DOI :
10.1109/THS.2008.4534428