Parallel dual tree traversal on multi-core and many-core architectures for astrophysical N-body simulations

Benoit Lange; Pierre Fortin

doi:10.1007/978-3-319-09873-9_60

Communication Dans Un Congrès Année : 2014

Parallel dual tree traversal on multi-core and many-core architectures for astrophysical N-body simulations

(1, 2) , (2)

1
2

Benoit Lange

Fonction : Auteur
PersonId : 2274
IdHAL : lange
ORCID : 0009-0007-7820-8249
IdRef : 164843876

Institut des Sciences du Calcul et des Données

Performance et Qualité des Algorithmes Numériques

Pierre Fortin

Fonction : Auteur
PersonId : 2113
IdHAL : pierre-fortin
ORCID : 0000-0003-3117-9122
IdRef : 11411255X

Performance et Qualité des Algorithmes Numériques

Résumé

In astrophysical N-body simulations, Dehnen's algorithm, implemented in the serial falcON code and based on a dual tree traversal, is faster than serial Barnes-Hut tree-codes, but outperformed by parallel CPU and GPU tree-codes. In this paper, we present a parallel dual tree traversal, implemented in the pfalcON code, targeting multi-core CPUs and many- core architectures (Xeon Phi). We focus here on both performance and portability, while preserving Dehnen's original algorithm. We first use task parallelism, with either OpenMP or Intel TBB, for the dual tree traversal. We then rely on the SPMD (single-program, multiple- data) model for the SIMD vectorization of the near field part thanks to the Intel SPMD Program Compiler. We compare the pfalcON performance to related work, and finally obtain performance results that match one of the best current tree-code implementations on GPU.

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

RR_hal-00947130_V2.pdf (578.67 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Benoit Lange : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-00947130

Soumis le : vendredi 30 mai 2014-14:34:38

Dernière modification le : mardi 11 avril 2023-15:16:28

Archivage à long terme le : samedi 30 août 2014-10:44:24

Dates et versions

hal-00947130 , version 1 (14-02-2014)

hal-00947130 , version 2 (30-05-2014)

Identifiants

HAL Id : hal-00947130 , version 2
DOI : 10.1007/978-3-319-09873-9_60

Citer

Benoit Lange, Pierre Fortin. Parallel dual tree traversal on multi-core and many-core architectures for astrophysical N-body simulations. 20th International Conference Euro-Par 2014 Parallel Processing, Aug 2014, Porto, Portugal. pp.716-727, ⟨10.1007/978-3-319-09873-9_60⟩. ⟨hal-00947130v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 ICS SORBONNE-UNIVERSITE SU-SCIENCES FED-3 ISCD

1018 Consultations

1034 Téléchargements

Parallel dual tree traversal on multi-core and many-core architectures for astrophysical N-body simulations

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager