论文标题
数据驱动的方法,用于预测地面臭氧浓度
A data-driven approach to the forecasting of ground-level ozone concentration
论文作者
论文摘要
预测城市地区空气污染物集中的能力对于希望通过积极措施(例如临时交通封闭)来减少污染对公共卫生的影响的决策者至关重要。在这项研究中,我们提出了一种机器学习方法,该方法适用于瑞士南部几个地理位置的臭氧浓度日期最大值的预测。由于测量站的密度较低和用例地形的复杂地形,我们采用了特征选择方法,而不是明确限制相关特征到预测站点的邻里,这在时空预测方法中很常见。然后,我们使用莎普利值来评估学习模型的解释性,以特征重要性和与臭氧预测有关的特征相互作用;我们的分析表明,受过训练的模型有效地学习了大气变量之间的解释性交叉依赖性。最后,我们展示了加权观测如何有助于提高臭氧每日峰值特定范围的预测的准确性。
The ability to forecast the concentration of air pollutants in an urban region is crucial for decision-makers wishing to reduce the impact of pollution on public health through active measures (e.g. temporary traffic closures). In this study, we present a machine learning approach applied to the forecast of the day-ahead maximum value of the ozone concentration for several geographical locations in southern Switzerland. Due to the low density of measurement stations and to the complex orography of the use case terrain, we adopted feature selection methods instead of explicitly restricting relevant features to a neighbourhood of the prediction sites, as common in spatio-temporal forecasting methods. We then used Shapley values to assess the explainability of the learned models in terms of feature importance and feature interactions in relation to ozone predictions; our analysis suggests that the trained models effectively learned explanatory cross-dependencies among atmospheric variables. Finally, we show how weighting observations helps in increasing the accuracy of the forecasts for specific ranges of ozone's daily peak values.