TY - JOUR PY - 2019// TI - Beyond early warning indicators: high school dropout and machine learning JO - Oxford bulletin of economics and statistics A1 - Sansone, Dario SP - 456 EP - 485 VL - 81 IS - 2 N2 - This paper combines machine learning with economic theory in order to analyse high school dropout. It provides an algorithm to predict which students are going to drop out of high school by relying only on information from 9th grade. This analysis emphasizes that using a parsimonious early warning system - as implemented in many schools - leads to poor results. It shows that schools can obtain more precise predictions by exploiting the available high-dimensional data jointly with machine learning tools such as Support Vector Machine, Boosted Regression and Post-LASSO. Goodness-of-fit criteria are selected based on the context and the underlying theoretical framework: model parameters are calibrated by taking into account the policy goal - minimizing the expected dropout rate - and the school budget constraint. Finally, this study verifies the existence of heterogeneity through unsupervised machine learning by dividing students at risk of dropping out into different clusters.
Language: en
LA - en SN - 0305-9049 UR - http://dx.doi.org/10.1111/obes.12277 ID - ref1 ER -