论文标题
水质管理可解释的可解释的预测功能
Mining Explainable Predictive Features for Water Quality Management
论文作者
论文摘要
借助水质管理过程,识别和解释特征之间的关系,例如位置和天气变量元组以及水质变量(例如细菌水平),是获得见解和确定应进行干预措施的领域的关键。需要搜索过程来确定影响水质的现象的位置和类型,并需要解释如何影响质量以及哪些因素最相关。本文解决了这两个问题。开发了一个过程,用于收集代表空间区域各种变量并用于训练模型和推理的功能的数据。使用模型和沙普利值对功能的性能进行分析。 Shapley值起源于合作游戏理论,可用于帮助解释机器学习结果。使用都柏林大运河盆地的几种机器学习算法和水质数据进行评估。
With water quality management processes, identifying and interpreting relationships between features, such as location and weather variable tuples, and water quality variables, such as levels of bacteria, is key to gaining insights and identifying areas where interventions should be made. There is a need for a search process to identify the locations and types of phenomena that are influencing water quality and a need to explain how the quality is being affected and which factors are most relevant. This paper addresses both of these issues. A process is developed for collecting data for features that represent a variety of variables over a spatial region and which are used for training models and inference. An analysis of the performance of the features is undertaken using the models and Shapley values. Shapley values originated in cooperative game theory and can be used to aid in the interpretation of machine learning results. Evaluations are performed using several machine learning algorithms and water quality data from the Dublin Grand Canal basin.