Title :
A data mining approach for preventing undergraduate students retention
Author :
Silva, Hadautho Roberto Barros da ; Adeodato, Paulo Jorge Leitão
Author_Institution :
Center for Inf., UFPE - Fed. Univ. of Pernambuco, Recife, Brazil
Abstract :
The Brazilian government has invested funds via the REUNI program to increase the amount of places in public universities. It has also made another important attempt to increase the number of places by approving regulations to legalize the release of the places occupied by retained students for new students. According to these regulations´ retention criteria, there were around 50% of places retained by these students. This Article presents a data mining approach for assessing the risk of undergraduate student retention at the end of the second semester of course aiming at supporting the counseling of students to prevent their retention. The approach has been developed for the Federal University of Pernambuco, focusing on 6 of its major courses which involved data transformation from a database of over 400,000 subjects records, the application of logistic regression for risk assessment, and induction of rules for explaining that risk at the counseling process. The risk estimation solution reached Max_KS=0.51 and AUC_ROC=0.84 in performance. Three scenarios of 30%, 50% and 70% of efficacy in the counseling process show that the procedure should be implemented and is economically viable with decision threshold and return of investment varying according to that efficacy.
Keywords :
data mining; educational courses; educational institutions; further education; government data processing; investment; regression analysis; Brazilian government; Federal University of Pernambuco; REUNI program; counseling process; data mining approach; data transformation; decision threshold; logistic regression; public universities; regulation retention criteria; risk assessment; risk estimation solution; second semester; students counseling; undergraduate students retention prevention; Data mining; Databases; Educational institutions; Employee welfare; Logistics; Training; a priori; data mining; logistic regression; student evasion; student retention;
Conference_Titel :
Neural Networks (IJCNN), The 2012 International Joint Conference on
Conference_Location :
Brisbane, QLD
Print_ISBN :
978-1-4673-1488-6
Electronic_ISBN :
2161-4393
DOI :
10.1109/IJCNN.2012.6252437