GPU加速自适应FSAI预处理，用于大规模平行模拟

论文标题

GPU加速自适应FSAI预处理，用于大规模平行模拟

A GPU-accelerated adaptive FSAI preconditioner for massively parallel simulations

论文作者

Isotton, Giovanni, Janna, Carlo, Bernaschi, Massimo

论文摘要

方程式线性系统的解决方案是许多科学和工程应用中的核心任务。在许多情况下，线性系统的解决方案可能需要大部分仿真时间，从而代表了科学和技术软件的进一步开发中的主要瓶颈。对于大规模模拟，如今占数百万甚至数十亿个未知数，诉诸于预处理的迭代求解器非常常见，以利用其低内存需求以及至少潜在的并行性。在各种情况下，近似反相已被证明是鲁棒和有效的预处理。在这项工作中，我们展示了自适应FSAI是如何在配备GPU加速器配备的分布式内存计算机上成功实现的大约具有高度并行性的近相。在自适应FSAI设置中利用GPU并不是一项琐碎的任务，但是我们通过广泛的数值实验表明，所提出的方法在挑战性的线性代数问题中表现出更多的传统预处理，并导致近距离行为。

The solution of linear systems of equations is a central task in a number of scientific and engineering applications. In many cases the solution of linear systems may take most of the simulation time thus representing a major bottleneck in the further development of scientific and technical software. For large scale simulations, nowadays accounting for several millions or even billions of unknowns, it is quite common to resort to preconditioned iterative solvers for exploiting their low memory requirements and, at least potential, parallelism. Approximate inverses have been shown to be robust and effective preconditioners in various contexts. In this work, we show how adaptive FSAI, an approximate inverse characterized by a very high degree of parallelism, can be successfully implemented on a distributed memory computer equipped with GPU accelerators. Taking advantage of GPUs in adaptive FSAI set-up is not a trivial task, nevertheless we show through an extensive numerical experimentation how the proposed approach outperforms more traditional preconditioners and results in a close-to-ideal behaviour in challenging linear algebra problems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题