TY - GEN
T1 - Evaluating the Indoor Radon Concentrations in the Swedish Building Stock Using Statistical and Machine Learning
AU - Wu, Pei-Yu
AU - Johansson, Tim
AU - Mangold, Mikael
AU - Sandels, Claes
AU - Mjörnell, Kristina
PY - 2023
Y1 - 2023
N2 - Exposure to excessive indoor radon causes around 500 lung cancer deaths in Sweden annually. However, until 2020, indoor radon measurements were only conducted in around 16% of Swedish single-family houses and 17% of multifamily houses. It is estimated that approximately 16% of single-family houses exceed the indoor radon reference level of 200 Bq/m3, and the corresponding situation in multifamily houses is unknown. Measuring indoor radon on an urban scale is complicated and costly. Statistical and machine learning, exploiting historical data for pattern identification, provides alternative approaches for assessing indoor radon risk in existing dwellings. By training MARS (Multivariate Adaptive Regression Splines) and Random Forest (RF) regression models with the data labels from the radon measurement records in the Swedish Energy Performance Certification registers, property registers, soil maps, and the radiometric grids, the correlations between response and predictive variables can be untangled. The interplay of the key features, including uranium and thorium concentrations, ventilation systems, construction year, basements, and the number of floors, and their impact magnitudes on indoor radon concentrations, are investigated in the study. The regression models tailored for different building classes were developed and evaluated. Despite the data complexity, the RF models can explain 28% of the variance in multifamily houses, 24% in all buildings, and 21% in single-family houses. To improve model fitting, more intricate supervised learning algorithms should be explored in the future. The study outcomes can contribute to prioritizing remediation measures for building stocks suspected of high indoor radon risk.
AB - Exposure to excessive indoor radon causes around 500 lung cancer deaths in Sweden annually. However, until 2020, indoor radon measurements were only conducted in around 16% of Swedish single-family houses and 17% of multifamily houses. It is estimated that approximately 16% of single-family houses exceed the indoor radon reference level of 200 Bq/m3, and the corresponding situation in multifamily houses is unknown. Measuring indoor radon on an urban scale is complicated and costly. Statistical and machine learning, exploiting historical data for pattern identification, provides alternative approaches for assessing indoor radon risk in existing dwellings. By training MARS (Multivariate Adaptive Regression Splines) and Random Forest (RF) regression models with the data labels from the radon measurement records in the Swedish Energy Performance Certification registers, property registers, soil maps, and the radiometric grids, the correlations between response and predictive variables can be untangled. The interplay of the key features, including uranium and thorium concentrations, ventilation systems, construction year, basements, and the number of floors, and their impact magnitudes on indoor radon concentrations, are investigated in the study. The regression models tailored for different building classes were developed and evaluated. Despite the data complexity, the RF models can explain 28% of the variance in multifamily houses, 24% in all buildings, and 21% in single-family houses. To improve model fitting, more intricate supervised learning algorithms should be explored in the future. The study outcomes can contribute to prioritizing remediation measures for building stocks suspected of high indoor radon risk.
U2 - 10.1088/1742-6596/2654/1/012086
DO - 10.1088/1742-6596/2654/1/012086
M3 - Paper in conference proceeding
T3 - Journal of Physics: Conference Series
BT - 13th Nordic Symposium on Building Physics (NSB-2023) 12/06/2023 - 14/06/2023 Aalborg, Denmark
T2 - 13th Nordic Symposium on Building Physics
Y2 - 12 June 2023 through 14 June 2023
ER -