TY - JOUR
T1 - Hybridizing spatial machine learning to explore the fine-scale heterogeneity between stunting prevalence and its associated risk determinants in Rwanda
AU - Nduwayezu, Gilbert
AU - Mansourian, Ali
AU - Bizimana, Jean Pierre
AU - Pilesjö, Petter
N1 - Publisher Copyright:
© 2025 Wuhan University. Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2025
Y1 - 2025
N2 - Childhood stunting is a serious global public health issue that exhibits local spatial variations. Previous studies have used traditional statistical methods to identify stunting risk factors, and little is known about the application and usefulness of spatial machine learning techniques in identifying localized stunting risk factors based on complex datasets. This study assesses the performance of hybrid spatial machine learning techniques in predicting stunting among children below the age of five in Rwanda. We cross-sectionally examined Bayesian-modeled surface stunting prevalence data linked with their related covariates obtained from the 2019–2020 Rwanda Demographic and Health Survey. Using these datasets, we implemented geographical weighted summary statistics, global random forest, and hybrid random forest model complimented with interpretable machine learning to identify local disparities in the association between stunting prevalence and its related risk factors. The results revealed significant variation in stunting prevalence within different areas, with the Western and Northern Province regions exhibiting higher stunting prevalence compared to the other provinces in the country. Our findings demonstrate the superiority of the hybrid random forest model over the global random forest model in achieving a more accurate fit when explaining stunting prevalence. Additionally, our findings reveal a non-linear relationship between stunting prevalence risk and its predictors. Specifically, we observed the highest risk of stunting when the percentage of households without toilet facility reached 2%. However, when the proportion of antenatal visits, men’s education, women’s literacy, access to clean water, and delivery place reached 50%, 85%, 80%, 70%, and 95%, respectively, the risk of stunting prevalence was at its lowest point. Furthermore, our findings indicate a lower prevalence of stunting when less than 20% of households use insecticide-treated nets. Localized information on stunting is highly valued by stakeholders for measuring and monitoring progress toward sustainable development goals.
AB - Childhood stunting is a serious global public health issue that exhibits local spatial variations. Previous studies have used traditional statistical methods to identify stunting risk factors, and little is known about the application and usefulness of spatial machine learning techniques in identifying localized stunting risk factors based on complex datasets. This study assesses the performance of hybrid spatial machine learning techniques in predicting stunting among children below the age of five in Rwanda. We cross-sectionally examined Bayesian-modeled surface stunting prevalence data linked with their related covariates obtained from the 2019–2020 Rwanda Demographic and Health Survey. Using these datasets, we implemented geographical weighted summary statistics, global random forest, and hybrid random forest model complimented with interpretable machine learning to identify local disparities in the association between stunting prevalence and its related risk factors. The results revealed significant variation in stunting prevalence within different areas, with the Western and Northern Province regions exhibiting higher stunting prevalence compared to the other provinces in the country. Our findings demonstrate the superiority of the hybrid random forest model over the global random forest model in achieving a more accurate fit when explaining stunting prevalence. Additionally, our findings reveal a non-linear relationship between stunting prevalence risk and its predictors. Specifically, we observed the highest risk of stunting when the percentage of households without toilet facility reached 2%. However, when the proportion of antenatal visits, men’s education, women’s literacy, access to clean water, and delivery place reached 50%, 85%, 80%, 70%, and 95%, respectively, the risk of stunting prevalence was at its lowest point. Furthermore, our findings indicate a lower prevalence of stunting when less than 20% of households use insecticide-treated nets. Localized information on stunting is highly valued by stakeholders for measuring and monitoring progress toward sustainable development goals.
KW - hybrid random forest
KW - interpretable machine learning
KW - local variation
KW - nonlinear effects
KW - Stunting prevalence
U2 - 10.1080/10095020.2025.2459133
DO - 10.1080/10095020.2025.2459133
M3 - Article
AN - SCOPUS:86000522796
SN - 1009-5020
JO - Geo-Spatial Information Science
JF - Geo-Spatial Information Science
ER -