Title :
Discovering topics from dark websites
Author :
Yang, Li ; Liu, Feiqiong ; Kizza, Joseph M. ; Ege, Raimund K.
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. of Tennessee at Chattanooga, Chattanooga, TN
fDate :
March 30 2009-April 2 2009
Abstract :
Analysis of dark Websites is important for developing effective combating strategies against terrorism or extremists when more and more scattered terrorist cells use the ubiquity of the Internet to form communities in virtual space with fairly low costs. Terrorists or extremists anonymously set up various Web sites embedded in the public Internet, exchanging ideology, spreading propaganda, and recruiting new members. In this paper, we propose a framework to discover latent topics via analyzing contents of dark Websites. The content and data from dark Websites are gathered and extracted by crawlers and exported to documents. Latent Dirichlet allocation (LDA) algorithm is used to analyze the extracted documents so as to discover latent topics from web sites of terrorists or extremists. In contrast to the traditional information retrieval (IR) schemes, LDA-based analysis assigns a probability to a document and captures exchangeability of both words and documents. Our work helps to gain insights into the structure and communities of terrorists and extremists.
Keywords :
Internet; security of data; Internet; dark Websites; information retrieval schemes; latent Dirichlet allocation; Algorithm design and analysis; Costs; Crawlers; Data mining; Information retrieval; Internet; Linear discriminant analysis; Recruitment; Scattering; Terrorism;
Conference_Titel :
Computational Intelligence in Cyber Security, 2009. CICS '09. IEEE Symposium on
Conference_Location :
Nashville, TN
Print_ISBN :
978-1-4244-2769-7
DOI :
10.1109/CICYBS.2009.4925106