Title :
Improving Watchlist Screening By Combining Evidence From Multiple Search Algorithms
Author :
Miller, Keith J. ; Arehart, Mark D.
Author_Institution :
MITRE Corp., McLean, VA
Abstract :
In this paper, we describe a metasearch tool resulting from experiments in aggregating the results of different name matching algorithms on a knowledge- intensive multicultural name matching task. Three retrieval engines that match Romanized names were tested on a noisy and predominantly Arabic dataset. One is based on a generic string matching algorithm; another is designed specifically for Arabic names; and the third makes use of culturally-specific matching strategies for multiple cultures. We show that even a relatively naive method for aggregating results significantly increased effectiveness over each of the individual algorithms, resulting in nearly tripling the F-score of the worst-performing algorithm included in the aggregate, and in a 6 point improvement in F-score over the single best-performing algorithm included.
Keywords :
national security; search engines; string matching; Arabic names; Romanized names; generic string matching algorithm; metasearch tool; multiple search algorithms; name matching; watchlist screening improvement; Aggregates; Algorithm design and analysis; Cultural differences; Databases; Engines; Government; Information retrieval; Metasearch; Testing; Writing;
Conference_Titel :
Technologies for Homeland Security, 2008 IEEE Conference on
Conference_Location :
Waltham, MA
Print_ISBN :
978-1-4244-1977-7
Electronic_ISBN :
978-1-4244-1978-4
DOI :
10.1109/THS.2008.4534432