Title :
Web content mining for alias identification: A first step towards suspect tracking
Author :
Anwar, Tarique ; Abulaish, Muhammad ; Alghathbar, Khaled
Author_Institution :
Center of Excellence in Inf. Assurance, King Saud Univ., Riyadh, Saudi Arabia
Abstract :
In this paper, we present the design of a web content mining system to identify and extract aliases of a given entity from the Web in an automatic way. Starting with a pattern-based information extraction process, the system applies n-gram technique to extract candidate aliases. Thereafter, various statistical measures are applied to identify feasible aliases from them. The extracted aliases can be used to generate profiles of suspects and keep track of their movements on the Web using different identities.
Keywords :
Internet; data mining; Web content mining; alias identification; n-gram technique; pattern based information extraction process; statistical measurement; suspect tracking; Irrigation; Monitoring; Support vector machines; World Wide Web; Alias identification; Cyber security; Suspect profiling; Web content mining; Web monitoring;
Conference_Titel :
Intelligence and Security Informatics (ISI), 2011 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-0082-8
DOI :
10.1109/ISI.2011.5984000