Title :
Improving Every Child´s Chance in Life
Author :
Kolyshkina, Inna ; Brownlow, Marcus ; Taylor, James
Author_Institution :
Sch. of Inf. Technol. & Math. Sci., Univ. of South Australia, Adelaide, SA, Australia
Abstract :
This article describes the results of a data mining project designed to explore the key drivers of the Australian Early Development Index (AEDI), a numerical indicator of early childhood development vulnerability. The work was conducted during GovHack 2013, a 48-hour Australian Open Data competition where participants were required to use published open data sets provided by various Australian government and other agencies. We applied advanced machine learning techniques (random forests, generalised boosted regression models and multivariate adaptive regression splines) to the South Australian state and national data to gain insights into the key drivers of AEDI and to quantify the levers that the state government, community and individuals could apply to improve the situation. We found that after accounting for the population specifics and socioeconomic conditions, for example unemployment level and Index of Relative Socioeconomic Disadvantage, the most important factors impacting early childhood development were lack of motor vehicle in the household, inability to afford buying medication and maternal smoking during pregnancy. We quantified the impact of each of these factors and suggested relevant potential Government interventions. We then visualised our findings and created a Web app that allowed various intervention strategies to be interactively explored, based on the derived relationship between early child development index and its key drivers.
Keywords :
data mining; government data processing; learning (artificial intelligence); AEDI; Australian early development index; Australian government; Australian open data competition; South Australian state; buying medication; childhood development vulnerability; data mining project; every child chance improvement; generalised boosted regression models; government interventions; machine learning techniques; maternal smoking; multivariate adaptive regression splines; numerical indicator; pregnancy; socioeconomic conditions; Australia; Data visualization; Government; Indexes; Mathematical model; Pediatrics; Vehicles; AEDI; GovHack; R; Tableau; early childhood development; generalised boosted models; multivariate adaptive regression splines; open data; random forests; social health atlas;
Conference_Titel :
Data Mining Workshops (ICDMW), 2013 IEEE 13th International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4799-3143-9
DOI :
10.1109/ICDMW.2013.61