TY - JOUR
T1 - Human-induced arsenic pollution modeling in surface waters
T2 - An integrated approach using machine learning algorithms and environmental factors
AU - Mohammadi, Maziar
AU - Naghibi, Seyed Amir
AU - Motevalli, Alireza
AU - Hashemi, Hossein
PY - 2022
Y1 - 2022
N2 - In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations >8 ppm and <8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to arsenic. We trained and validated the models using 70 and 30% of both polluted and non-polluted samples, respectively, and generated surface vulnerability maps. To verify the maps to arsenic pollution, the receiver operating characteristics (ROC) curve was implemented. The results approved the acceptable performance of the RF and BRT algorithms with an area under ROC values of 85% and 75.6%, respectively. Further, the findings showed higher importance of precipi- tation, slope aspect, distance from residential areas, and slope length in arsenic pollution in the modeling pro- cess. Erosion, lithology, and land use maps were introduced as the least important factors. The introduced methodology can be used to define the most vulnerable areas to arsenic pollution in advance and implement proper remediation actions to reduce the damages.
AB - In recent years, assessment of sediment contamination by heavy metals, i.e., arsenic, has attracted the interest of scientists worldwide. The present study provides a new methodology to better understand the factors influencing surface water vulnerability to arsenic pollution by two advanced machine learning algorithms including boosted regression trees (BRT) and random forest (RF). Based on the sediment quality guidelines (Effects range low) polluted and non-polluted arsenic sediment samples were defined with concentrations >8 ppm and <8 ppm, respectively. Different conditioning factors such as topographical, lithology, erosion, hydrological, and anthro- pogenic factors were acquired to model surface waters’ vulnerability to arsenic. We trained and validated the models using 70 and 30% of both polluted and non-polluted samples, respectively, and generated surface vulnerability maps. To verify the maps to arsenic pollution, the receiver operating characteristics (ROC) curve was implemented. The results approved the acceptable performance of the RF and BRT algorithms with an area under ROC values of 85% and 75.6%, respectively. Further, the findings showed higher importance of precipi- tation, slope aspect, distance from residential areas, and slope length in arsenic pollution in the modeling pro- cess. Erosion, lithology, and land use maps were introduced as the least important factors. The introduced methodology can be used to define the most vulnerable areas to arsenic pollution in advance and implement proper remediation actions to reduce the damages.
UR - https://www.scopus.com/pages/publications/85121592217
U2 - 10.1016/j.jenvman.2021.114347
DO - 10.1016/j.jenvman.2021.114347
M3 - Article
C2 - 34954681
SN - 0301-4797
VL - 305
JO - Journal of Environmental Management
JF - Journal of Environmental Management
ER -