Title :
IEDs in the dark web: Lexicon expansion and genre classification
Author_Institution :
Dept. of Manage. Inf. Syst., Univ. of Arizona, Tucson, AZ
Abstract :
Improvised explosive device web pages represent a significant source of knowledge for security organizations. In this paper, we present significant improvements to our approach to the discovery and classification of IED related web pages in the Dark Web. We present a statistical feature ranking approach to the expansion of the keyword lexicon used to discover IED related web pages, which identified new relevant terms for inclusion. Additionally, we present an improved web page feature representation designed to better capture the structural and stylistic cues revealing of genres of communication, and a series of experiments comparing the classification performance of the new representation with our existing approach.
Keywords :
Internet; classification; security of data; statistical analysis; IED; Web page feature representation; dark Web; genre classification; keyword lexicon; lexicon expansion; relevant term; security organization; statistical feature ranking; Artificial intelligence; Explosives; Information analysis; Information security; Internet; Military computing; Resource management; Terrorism; Weapons; Web pages; dark web; genre classification; improvised explosive device;
Conference_Titel :
Intelligence and Security Informatics, 2009. ISI '09. IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4171-6
Electronic_ISBN :
978-1-4244-4173-0
DOI :
10.1109/ISI.2009.5137293