Supercomputer simulations of structure formation in the Universe
Abstract
We describe the implementation and performance results of our massively parallel MPI†/OpenMP‡ hybrid TreePM code for large-scale cosmological N-body simulations. For domain decomposition, a recursive multi-section algorithm is used and the size of domains are automatically set so that the total calculation time is the same for all processes. We developed a highly-tuned gravity kernel for short-range forces, and a novel communication algorithm for long-range forces. For two trillion particles benchmark simulation, the average performance on the fullsystem of K computer (82,944 nodes, the total number of core is 663,552) is 5.8 Pflops, which corresponds to 55% of the peak speed.
- Publication:
-
Astroinformatics
- Pub Date:
- June 2017
- DOI:
- 10.1017/S174392131700045X
- Bibcode:
- 2017IAUS..325...10I
- Keywords:
-
- dark matter;
- halo;
- numerical