مرکز منطقه ای اطلاع رساني علوم و فناوري

DocumentCode :

3156251

Title :

Lessons from a Jihadi Corpus

Author :

Skillicorn, D.B.

Author_Institution :

Sch. of Comput., Queen´s Univ., Kingston, ON, Canada

fYear :

2012

fDate :

26-29 Aug. 2012

Firstpage :

874

Lastpage :

878

Abstract :

We analyze the posts in the Islamic Awareness forum, using models for frequent words (content), for Salafist-Jihadist language, and for deception. These last two models each produce a single-factor ranking enabling, in each case, the most useful subset of posts to be selected for further analysis. Posts that rank highly for Salafist-Jihadist language rank low for deception, suggesting that faking extremist websites is probably an ineffective strategy. The process described here is a template for analysis of many kinds of open-source corpora where language models of what makes posts interesting are known.

Keywords :

Web sites; natural language processing; public domain software; word processing; Islamic Awareness forum post analysis; Jihadi open-source corpus; Salafist-Jihadist language; content frequencies; extremist Web sites; language deception; single-factor ranking; template analysis; word frequencies; Analytical models; Computational modeling; Conferences; Educational institutions; Matrix decomposition; Periodic structures; Singular value decomposition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Advances in Social Networks Analysis and Mining (ASONAM), 2012 IEEE/ACM International Conference on

Conference_Location :

Istanbul

Print_ISBN :

978-1-4673-2497-7

Type :

conf

DOI :

10.1109/ASONAM.2012.239

Filename :

6425649

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3156251