Title :
AK-Modes: A weighted clustering algorithm for finding similar case subsets
Author :
Ma, Lianhang ; Chen, Yefang ; Huang, Hao
Author_Institution :
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
Abstract :
Finding similar crime case subsets is an important task for intelligence analysts in crime investigation. It can not only provide multiple clues to solve crimes but also improve efficiency to catch the criminals. However, the conventional approach by querying specific attributes in relational databases has two defects: first, it is relatively of poor efficiency when a lot of incidents have to be handled; second, the querying process can not reflect the importance of attributes in different case categories. In this paper, we propose a two-phase clustering algorithm called AK-Modes to automatically find the similar case subsets from large datasets. In the attribute-weighing phase, we compute the weight of each attribute related to an offender´s behavior trait using the concept Information Gain Ratio (IGR) in classification domain. Then the result of attribute-weighing phase is utilized in the clustering process to find the similar case subsets. Experiments show that AK-Modes is effective and can find significant results.
Keywords :
pattern clustering; police data processing; relational databases; AK-Modes; crime investigation; information gain ratio; intelligence analysts; relational databases; similar case subset finding; two-phase clustering algorithm; weighted clustering algorithm; Accuracy; Algorithm design and analysis; Classification algorithms; Clustering algorithms; Data mining; Databases; Training data; K-Modes clustering; crime data mining; information gain ratio; weighted attributes;
Conference_Titel :
Intelligent Systems and Knowledge Engineering (ISKE), 2010 International Conference on
Conference_Location :
Hangzhou
Print_ISBN :
978-1-4244-6791-4
DOI :
10.1109/ISKE.2010.5680876