RT Article
T1 Beyond Traditional Risk Scores: Tackling LS/CMI Offender Misclassifications with Machine Learning
JF Journal of quantitative criminology
VO 41
IS 4
SP 549
OP 574
A1 Arbour, William
A1 Brouillette-Alarie, Sébastien
A1 Giguère, Guy
A1 Lacroix, Guy
A1 Marchand, Steeve
A2 Brouillette-Alarie, Sébastien
A2 Giguère, Guy
A2 Lacroix, Guy
A2 Marchand, Steeve
LA English
YR 2025
UL https://krimdok.uni-tuebingen.de/Record/1942594151
AB ObjectivesThis paper investigates the accuracy of offender risk assessment scoring methods. We study the degree of misclassification resulting from the conventional practice of aggregating individual items to derive risk scores and categories. We document which types of offenders are prone to misclassification, particularly in relation to age and gender.MethodsWe use a machine learning algorithm to leverage the rich set of information available in the LS/CMI. Using all 45,535 assessments conducted between 2008 and 2015 in Quebec (Canada), we estimate probabilities from a random forest algorithm to predict individual risks of recidivism over a two-year follow-up. We compare the resulting probabilities to those inferred from the risk scores or categories to document the extent of misclassification. We devise a simple algorithm to construct alternative risk categories that reduce misclassification relative to the LS/CMI total scores and categories.Results The probabilities obtained from the random forest approach accurately predict individual probabilities to reoffend. Compared with these predictions, the traditional aggregation of items into risk scores or categories yields substantial misclassification for certain groups of offenders. In particular, we find that the risk associated with older individuals when using the LS/CMI risk categories is overestimated by about 10 percentage points. Our alternative risk categories, devised from our machine learning predictions, successfully avoid such misclassification.Conclusions Traditional methods of aggregating items from risk assessments into scores may lead to substantial misclassification, especially for older offenders. Misclassification arises from 1) items not being equally risk-relevant; 2) information collected by the LS/CMI being excluded or overly simplified when constructing scores; and 3) age being omitted from risk scores. Machine learning algorithms avoid these pitfalls and can be used to construct less biased categories.
K1 LS/CMI
K1 machine learning
K1 Recidivism
K1 Risk Assessment
DO 10.1007/s10940-025-09606-w