Title :
Using Self Organizing Map to cluster Arabic crime documents
Author :
Alruily, Meshrif ; Ayesh, Aladdin ; Al-Marghilani, Abdulsamad
Author_Institution :
Software Technol. Res. Lab., De Montfort Univ., Leicester, UK
Abstract :
This paper presents a system that combines two text mining techniques; information extraction and clustering. A rule-based approach is used to perform the information extraction task, based on the dependency relation between some intransitive verbs and prepositions. This relationship helps in extracting types of crime from documents within the crime domain. With regard to the clustering task, the Self Organizing Map (SOM) is used to cluster Arabic crime documents based on crime types. This work is then validated through experiments, the results of which show that the techniques developed here are promising.
Keywords :
data mining; document handling; information retrieval; knowledge based systems; natural language processing; pattern clustering; self-organising feature maps; task analysis; Arabic crime document clustering; dependency relation; information clustering; information extraction task; intransitive prepositions; intransitive verbs; rule based approach; self organizing map; text mining; Context; Data mining; Data visualization; Grammar; Neurons; Organizing; Pragmatics;
Conference_Titel :
Computer Science and Information Technology (IMCSIT), Proceedings of the 2010 International Multiconference on
Conference_Location :
Wisla
Print_ISBN :
978-1-4244-6432-6
DOI :
10.1109/IMCSIT.2010.5679616