论文标题
基于机器学习的基于双相氙时间投影室中脉冲分类的方法
A machine learning-based methodology for pulse classification in dual-phase xenon time projection chambers
论文作者
论文摘要
现在,机器学习技术在实验粒子物理学中已建立,可以通过新的和独特的方式分析探测器数据。粒子天文台中信号的识别是一个必不可少的数据处理任务,可以使用这种方法可以改进。本文旨在探讨专门的机器学习方法可能为双相贵重气体时间投影室中信号分类提供的好处。从使用高斯混合模型的探索性数据分析中提出了一种完整的方法,以及基于神经网络和随机森林的标准实现的专用预测模型的特征至关重要的排名,并使用来自LZ实验的未标记模拟数据作为对真实数据的代理进行了验证。这项工作中开发的预测模型的全球分类准确性估计为99.0%,这是对使用相同数据测试的常规算法的改进。聚类分析的结果还用于识别由错误估计的信号属性引起的数据中的异常情况,这表明该方法也可以用于数据监视。
Machine learning techniques are now well established in experimental particle physics, allowing detector data to be analysed in new and unique ways. The identification of signals in particle observatories is an essential data processing task that can potentially be improved using such methods. This paper aims at exploring the benefits that a dedicated machine learning approach might provide to the classification of signals in dual-phase noble gas time projection chambers. A full methodology is presented, from exploratory data analysis using Gaussian mixture models and feature importance ranking to the construction of dedicated predictive models based on standard implementations of neural networks and random forests, validated using unlabelled simulated data from the LZ experiment as a proxy to real data. The global classification accuracy of the predictive models developed in this work is estimated to be >99.0%, which is an improvement over conventional algorithms tested with the same data. The results from the clustering analysis were also used to identify anomalies in the data caused by miscalculated signal properties, showing that this methodology can also be used for data monitoring.