DocumentCode :
2755973
Title :
Web content mining for alias identification: A first step towards suspect tracking
Author :
Anwar, Tarique ; Abulaish, Muhammad ; Alghathbar, Khaled
Author_Institution :
Center of Excellence in Inf. Assurance, King Saud Univ., Riyadh, Saudi Arabia
fYear :
2011
fDate :
10-12 July 2011
Firstpage :
195
Lastpage :
197
Abstract :
In this paper, we present the design of a web content mining system to identify and extract aliases of a given entity from the Web in an automatic way. Starting with a pattern-based information extraction process, the system applies n-gram technique to extract candidate aliases. Thereafter, various statistical measures are applied to identify feasible aliases from them. The extracted aliases can be used to generate profiles of suspects and keep track of their movements on the Web using different identities.
Keywords :
Internet; data mining; Web content mining; alias identification; n-gram technique; pattern based information extraction process; statistical measurement; suspect tracking; Irrigation; Monitoring; Support vector machines; World Wide Web; Alias identification; Cyber security; Suspect profiling; Web content mining; Web monitoring;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligence and Security Informatics (ISI), 2011 IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-0082-8
Type :
conf
DOI :
10.1109/ISI.2011.5984000
Filename :
5984000
Link To Document :
بازگشت