论文标题
基于MPI-3共享内存的Euler-Lagrange模拟的混合平行化
Hybrid Parallelization of Euler-Lagrange Simulations Based on MPI-3 Shared Memory
论文作者
论文摘要
在非结构化网格上使用Euler-Lagrange方法将其应用区域扩展到更通用的设置。但是,缺乏常规拓扑限制了分布式并行方法的可扩展性,尤其是对于在空间中进行物理搜索的例程。最突出的放缓之一是在物理空间中寻找光环元素,以避免运行时通信。在这项工作中,我们提出了一种使用MPI-3共享内存模型的新的无通信晕元素搜索算法。与分布式并行化方法相比,这种新颖的方法消除了初始化过程中多到多次通信的严重性能瓶颈,并将可能的应用扩展到以前方法可实现的应用之外。然后,在这些数据结构的基础上,我们提出了有效粒子发射的方法,可扩展的沉积方案用于颗粒磁场耦合以及潜伏期隐藏方法。通过在大规模并行系统上的开源框架的等离子动力学模拟来验证所提出算法的缩放性能,证明131000个内核的效率高达80%。
The use of Euler-Lagrange methods on unstructured grids extends their application area to more versatile setups. However, the lack of a regular topology limits the scalability of distributed parallel methods, especially for routines that perform a physical search in space. One of the most prominent slowdowns is the search for halo elements in physical space for the purpose of runtime communication avoidance. In this work, we present a new communication-free halo element search algorithm utilizing the MPI-3 shared memory model. This novel method eliminates the severe performance bottleneck of many-to-many communication during initialization compared to the distributed parallelization approach and extends the possible applications beyond those achievable with the previous approach. Building on these data structures, we then present methods for efficient particle emission, scalable deposition schemes for particle-field coupling, and latency hiding approaches. The scaling performance of the proposed algorithms is validated through plasma dynamics simulations of an open-source framework on a massively parallel system, demonstrating an efficiency of up to 80% on 131000 cores.