DocumentCode :
189169
Title :
Using Political Party Affiliation Data to Measure Civil Servants´ Risk of Corruption
Author :
Carvalho, Rommel ; Carvalho, Rommel ; Ladeira, Marcelo ; Mendes Monteiro, Fernando ; Mendes, Gilson
Author_Institution :
Dept. of Strategic Inf., Brazilian Office of the Comptroller Gen., Brasilia, Brazil
fYear :
2014
fDate :
18-22 Oct. 2014
Firstpage :
166
Lastpage :
171
Abstract :
This paper presents a case study of machine learning applied to measure the risk of corruption of civil servants using political party affiliation data. Initially, a statistical hypothesis test verified the dependency between corruption and political party affiliation. Then, we constructed datasets with standardization and three different discrimination techniques. Using Weka environment, this work shows the application and statistical evaluation of four classification algorithms to build models for predicting risk of corruption: Bayesian Networks, Support Vector Machines, Random Forest, and Artificial Neural Networks with back propagation. To evaluate the models we used data mining metrics such as precision, recall, kappa statistic and percent correct. Lastly, the case study compares the learned model with the best performance to the experts´ model. The comparison not only confirms previous experts´ affirmations, but also provides new assertions on the affiliation-corruptibility relation.
Keywords :
backpropagation; belief networks; data mining; neural nets; pattern classification; politics; standardisation; statistical testing; support vector machines; Bayesian network classification algorithm; Weka environment; affiliation-corruptibility relation; artificial neural network classification algorithm; backpropagation; civil servant corruption risk measurement; corruption risk prediction; data mining metrics; discretization techniques; expert affirmations; expert model; kappa statistic; learned model; machine learning; percent correct; political party affiliation data; precision; random forest classification algorithm; recall; standardization; statistical evaluation; statistical hypothesis test; support vector machine classification algorithm; Artificial neural networks; Bayes methods; Data mining; Measurement; Prediction algorithms; Radio frequency; Support vector machines; Civil Servant; Corruption; Machine Learning; Political Affiliation; Random Forest;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Systems (BRACIS), 2014 Brazilian Conference on
Conference_Location :
Sao Paulo
Type :
conf
DOI :
10.1109/BRACIS.2014.39
Filename :
6984825
Link To Document :
بازگشت