A hierarchical fast direct solver for distributed memory machines with manycore nodes - Archive ouverte HAL Access content directly
Reports (Research Report) Year : 2019

A hierarchical fast direct solver for distributed memory machines with manycore nodes

(1) , (1) , (1) , (2) , (3, 4) , (3, 5)
1
2
3
4
5

Abstract

Compression techniques have revolutionized the Boundary Element Method used to solve the Maxwell equations in frequency domain. In spite of the several orders of magnitude gained in terms of computational cost, and resource consumption, their implementation in a direct solver remains challenging, especially on distributed memory machines. We present the design of an efficient and scalable hierarchical fast direct solver capable of factorizing H-matrices on large scale machines with manycore nodes. This task-based solver relies on a flexible execution model which features an extension of the sequential task flow (STF) paradigm, enabling seamless expression of complex dependencies between hierarchical data over distributed memory machines. We demonstrate its efficiency and its scalability by solving large scale problems over hundred of manycore nodes, and for example factorize a H-matrix with 4.4 million unknowns compressed at 99% in less than 40 minutes with about 70% of parallel efficiency over 24,320 cores.
Fichier principal
Vignette du fichier
HMAT-CEA-RR.pdf (2.19 Mo) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

cea-02304706 , version 1 (03-10-2019)

Identifiers

  • HAL Id : cea-02304706 , version 1

Cite

Cédric Augonnet, David Goudin, Matthieu Kuhn, Xavier Lacoste, Raymond Namyst, et al.. A hierarchical fast direct solver for distributed memory machines with manycore nodes. [Research Report] CEA/DAM; Total E&P; Université de Bordeaux. 2019. ⟨cea-02304706⟩
508 View
508 Download

Share

Gmail Facebook Twitter LinkedIn More