A dissection solver with kernel detection for symmetric finite element matrices on shared memory computers - Sorbonne Université
Article Dans Une Revue International Journal for Numerical Methods in Engineering Année : 2014

A dissection solver with kernel detection for symmetric finite element matrices on shared memory computers

Résumé

A direct solver for symmetric sparse matrices from finite element problems is presented. The solver is supposed to work as a local solver of domain decomposition methods for hybrid parallelization on cluster systems of multi-core CPUs, and then it is required to run on shared memory computers and to have an ability of kernel detection. Symmetric pivoting with a given threshold factorizes a matrix with a decomposition introduced by a nested bisection and selects suspicious null pivots from the threshold. The Schur complement constructed from the suspicious null pivots is examined by a factorization with 1x1 and 2x2 pivoting and by a robust kernel detection algorithm based on measurement of residuals with orthogonal projections onto supposed image spaces. A static data structure from the nested bisection and a block sub-structure for Schur complements at all bisection-levels can use level 3 BLAS routines efficiently. Asynchronous task execution for each block can reduce idle time of processors drastically and as a result, the solver has high parallel efficiency. Competitive performance of the developed solver to Intel Pardiso on shared memory computers is shown by numerical experiments.
Fichier principal
Vignette du fichier
SuzukiRoux-Apr2014.pdf (1.69 Mo) Télécharger le fichier
Origine Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00816916 , version 1 (23-04-2013)
hal-00816916 , version 2 (30-10-2013)
hal-00816916 , version 3 (04-04-2014)

Identifiants

Citer

Atsushi Suzuki, François-Xavier Roux. A dissection solver with kernel detection for symmetric finite element matrices on shared memory computers. International Journal for Numerical Methods in Engineering, 2014, 100 (2), pp.136-164. ⟨10.1002/nme.4729⟩. ⟨hal-00816916v3⟩
802 Consultations
616 Téléchargements

Altmetric

Partager

More