将模拟微酮事件上粒子分类的训练缩放到多个GPU

论文标题

将模拟微酮事件上粒子分类的训练缩放到多个GPU

Scaling the training of particle classification on simulated MicroBooNE events to multiple GPUs

论文作者

Hagen, Alex, Church, Eric, Strube, Jan, Bhattacharya, Kolahal, Amatya, Vinay

论文摘要

液体氩时间投影室（LARTPC）中微子检测器中的测量值，例如Fermilab的微酮检测器，具有大型，高富达事件图像。深度学习技术在照片的分类任务中非常成功，但是由于事件的范围很大，它们在LARTPC事件图像中的应用是具有挑战性的。这些检测器中的事件通常比经典挑战中的图像大两个数量级，例如对MNIST数据库中包含的手写数字的识别或Imagenet数据库中的对象识别。理想情况下，培训将在整个事件数据的许多实例上进行，而不是事件数据中裁剪的许多感兴趣区域的实例。但是，这样的努力导致了非常长的训练周期，从而减慢了对新网络体系结构和超参数扫描以提高分类性能的探索。我们介绍了在跨越多个节点的多个架构上缩放LARTPC分类问题的研究。研究是对微酮检测器中的模拟事件进行的。我们强调的是，从此处的任何结果中提取物理学，超出了这项研究的范围。已经使用了西北国家实验室和橡树岭国家实验室领导力计算设施的Summitdev机构计算。据我们所知，这是最先进的卷积神经网络用于粒子物理及其随之而来的计算技术到DOE领导班设施中的使用。我们期望在未来几十年中，旗舰美国高能物理学（HEP）计划，特别是在地下中微子实验（Dune）LARTPC计划（Dune）LARTPC计划中获得好处。

Measurements in Liquid Argon Time Projection Chamber (LArTPC) neutrino detectors, such as the MicroBooNE detector at Fermilab, feature large, high fidelity event images. Deep learning techniques have been extremely successful in classification tasks of photographs, but their application to LArTPC event images is challenging, due to the large size of the events. Events in these detectors are typically two orders of magnitude larger than images found in classical challenges, like recognition of handwritten digits contained in the MNIST database or object recognition in the ImageNet database. Ideally, training would occur on many instances of the entire event data, instead of many instances of cropped regions of interest from the event data. However, such efforts lead to extremely long training cycles, which slow down the exploration of new network architectures and hyperparameter scans to improve the classification performance. We present studies of scaling a LArTPC classification problem on multiple architectures, spanning multiple nodes. The studies are carried out on simulated events in the MicroBooNE detector. We emphasize that it is beyond the scope of this study to optimize networks or extract the physics from any results here. Institutional computing at Pacific Northwest National Laboratory and the SummitDev machine at Oak Ridge National Laboratory's Leadership Computing Facility have been used. To our knowledge, this is the first use of state-of-the-art Convolutional Neural Networks for particle physics and their attendant compute techniques onto the DOE Leadership Class Facilities. We expect benefits to accrue particularly to the Deep Underground Neutrino Experiment (DUNE) LArTPC program, the flagship US High Energy Physics (HEP) program for the coming decades.

下载PDF全文

下载文献需遵守相关版权规定

论文标题