N. Arora, A. Shringarpure, and R. Vuduc, Direct N-body Kernels for Multicore Platforms, 2009 International Conference on Parallel Processing, pp.379-387, 2009.
DOI : 10.1109/ICPP.2009.71

J. E. Barnes and P. Hut, A hierarchical O(N log N) force-calculation algorithm, Nature, vol.6, issue.6096, pp.446-449, 1986.
DOI : 10.1038/324446a0

J. Bédorf, E. Gaburov, and S. P. Zwart, A sparse octree gravitational N-body code that runs entirely on the GPU processor, Journal of Computational Physics, vol.231, issue.7, pp.2825-2839, 2012.
DOI : 10.1016/j.jcp.2011.12.024

M. Burtscher and K. Pingali, An Efficient CUDA Implementation of the Tree-Based Barnes Hut n-Body Algorithm. GPU computing Gems Emerald edition, p.75, 2011.

H. Cheng, L. Greengard, and V. Rokhlin, A Fast Adaptive Multipole Algorithm in Three Dimensions, Journal of Computational Physics, vol.155, issue.2, pp.468-498, 1999.
DOI : 10.1006/jcph.1999.6355

W. Dehnen, A Hierarchical (N) Force Calculation Algorithm, Journal of Computational Physics, vol.179, issue.1, pp.27-42, 2002.
DOI : 10.1006/jcph.2002.7026

W. Dehnen, A fast multipole method for stellar dynamics, Computational Astrophysics and Cosmology, vol.111, issue.7, 2014.
DOI : 10.1186/s40668-014-0001-7

P. Fortin and J. Lamotte, Fast Multipole Method on the Cell B.E.: the Near Field Part, Int. Parallel Computing Conf. (ParCo), pp.323-330, 2009.

P. Fortin, E. Athanassoula, L. , and J. , -body simulations, Astronomy & Astrophysics, vol.531, p.120, 2011.
DOI : 10.1051/0004-6361/201015933

URL : https://hal.archives-ouvertes.fr/halsde-00908669

P. Londrillo, C. Nipoti, and L. Ciotti, A parallel implementation of a new fast algorithm for N-body simulations, Comp. astro. in Italy: methods and tools, 2002.

M. Pharr and W. R. Mark, ispc: A SPMD compiler for high-performance CPU programming, 2012 Innovative Parallel Computing (InPar), pp.1-13, 2012.
DOI : 10.1109/InPar.2012.6339601

V. Springel, The cosmological simulation code GADGET-2, Monthly Notices of the Royal Astronomical Society, vol.364, issue.4, pp.1105-1134, 2005.
DOI : 10.1111/j.1365-2966.2005.09655.x

K. Taura, J. Nakashima, R. Yokota, and N. Maruyama, A Task Parallel Implementation of Fast Multipole Methods, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.617-625, 2012.
DOI : 10.1109/SC.Companion.2012.86

M. S. Warren and J. K. Salmon, A parallel hashed Oct-Tree N-body algorithm, Proceedings of the 1993 ACM/IEEE conference on Supercomputing , Supercomputing '93, pp.12-21, 1993.
DOI : 10.1145/169627.169640

R. Yokota, An FMM Based on Dual Tree Traversal for Many-core Architectures, Journal of Algorithms and Computational Technology, vol.7, issue.3, pp.301-324, 2013.