Weak scaling performance, Free benchmark, Broadwell, 1M particles/node
Performance in millions of particle-timesteps / second / node

Nodes CPU (mpi,hyper) Kokkos/OMP (mpi,hyper,thread) Kokkos/serial (mpi,hyper)
1 529 (36,1) 488.7 (18,1,2) 480 (72,2)
2 523.8 (36,1) 479.6 (18,2,4) 479.5 (36,1)
4 460.6 (36,1) 424.1 (18,2,4) 441.5 (36,1)
8 445.9 (36,1) 387 (18,2,4) 416.6 (72,2)
16 446.2 (72,2) 359.7 (18,2,4) 407.3 (72,2)
32 445.7 (36,1) 298 (18,2,4) 433.1 (36,1)
64 390.4 (36,1) 310 (18,1,2) 303.7 (72,2)

Run commands and logfile links for column CPU

1 mpiexec -np 36 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_cpu -v x 64 -v y 40 -v z 40 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=cpu.kind=weak.size=1M.node=1.mpi=36.hyper=1
2 mpiexec -np 72 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_cpu -v x 64 -v y 40 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=cpu.kind=weak.size=1M.node=2.mpi=36.hyper=1
4 mpiexec -np 144 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_cpu -v x 64 -v y 80 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=cpu.kind=weak.size=1M.node=4.mpi=36.hyper=1
8 mpiexec -np 288 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_cpu -v x 128 -v y 80 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=cpu.kind=weak.size=1M.node=8.mpi=36.hyper=1
16 mpiexec -np 1152 -npernode 72 --oversubscribe --bind-to core ./spa_serrano_cpu -v x 128 -v y 80 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=cpu.kind=weak.size=1M.node=16.mpi=72.hyper=2
32 mpiexec -np 1152 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_cpu -v x 128 -v y 160 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=cpu.kind=weak.size=1M.node=32.mpi=36.hyper=1
64 mpiexec -np 2304 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_cpu -v x 256 -v y 160 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=cpu.kind=weak.size=1M.node=64.mpi=36.hyper=1

Run commands and logfile links for column Kokkos/OMP

1 mpiexec -np 18 -npernode 18 --oversubscribe --bind-to socket ./spa_serrano_kokkos_omp -sf kk -k on t 2 -pk kokkos reduction parallel/reduce comm classic -v x 64 -v y 40 -v z 40 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_omp.kind=weak.size=1M.node=1.mpi=18.thread=2.hyper=1
2 mpiexec -np 36 -npernode 18 --oversubscribe --bind-to socket ./spa_serrano_kokkos_omp -sf kk -k on t 4 -pk kokkos reduction parallel/reduce comm classic -v x 64 -v y 40 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_omp.kind=weak.size=1M.node=2.mpi=18.thread=4.hyper=2
4 mpiexec -np 72 -npernode 18 --oversubscribe --bind-to socket ./spa_serrano_kokkos_omp -sf kk -k on t 4 -pk kokkos reduction parallel/reduce comm classic -v x 64 -v y 80 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_omp.kind=weak.size=1M.node=4.mpi=18.thread=4.hyper=2
8 mpiexec -np 144 -npernode 18 --oversubscribe --bind-to socket ./spa_serrano_kokkos_omp -sf kk -k on t 4 -pk kokkos reduction parallel/reduce comm classic -v x 128 -v y 80 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_omp.kind=weak.size=1M.node=8.mpi=18.thread=4.hyper=2
16 mpiexec -np 288 -npernode 18 --oversubscribe --bind-to socket ./spa_serrano_kokkos_omp -sf kk -k on t 4 -pk kokkos reduction parallel/reduce comm classic -v x 128 -v y 80 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_omp.kind=weak.size=1M.node=16.mpi=18.thread=4.hyper=2
32 mpiexec -np 576 -npernode 18 --oversubscribe --bind-to socket ./spa_serrano_kokkos_omp -sf kk -k on t 4 -pk kokkos reduction parallel/reduce comm classic -v x 128 -v y 160 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_omp.kind=weak.size=1M.node=32.mpi=18.thread=4.hyper=2
64 mpiexec -np 1152 -npernode 18 --oversubscribe --bind-to socket ./spa_serrano_kokkos_omp -sf kk -k on t 2 -pk kokkos reduction parallel/reduce comm classic -v x 256 -v y 160 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_omp.kind=weak.size=1M.node=64.mpi=18.thread=2.hyper=1

Run commands and logfile links for column Kokkos/serial

1 mpiexec -np 72 -npernode 72 --oversubscribe --bind-to core ./spa_serrano_kokkos_serial -sf kk -k on -pk kokkos reduction parallel/reduce comm classic -v x 64 -v y 40 -v z 40 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_serial.kind=weak.size=1M.node=1.mpi=72.hyper=2
2 mpiexec -np 72 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_kokkos_serial -sf kk -k on -pk kokkos reduction parallel/reduce comm classic -v x 64 -v y 40 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_serial.kind=weak.size=1M.node=2.mpi=36.hyper=1
4 mpiexec -np 144 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_kokkos_serial -sf kk -k on -pk kokkos reduction parallel/reduce comm classic -v x 64 -v y 80 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_serial.kind=weak.size=1M.node=4.mpi=36.hyper=1
8 mpiexec -np 576 -npernode 72 --oversubscribe --bind-to core ./spa_serrano_kokkos_serial -sf kk -k on -pk kokkos reduction parallel/reduce comm classic -v x 128 -v y 80 -v z 80 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_serial.kind=weak.size=1M.node=8.mpi=72.hyper=2
16 mpiexec -np 1152 -npernode 72 --oversubscribe --bind-to core ./spa_serrano_kokkos_serial -sf kk -k on -pk kokkos reduction parallel/reduce comm classic -v x 128 -v y 80 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_serial.kind=weak.size=1M.node=16.mpi=72.hyper=2
32 mpiexec -np 1152 -npernode 36 --oversubscribe --bind-to core ./spa_serrano_kokkos_serial -sf kk -k on -pk kokkos reduction parallel/reduce comm classic -v x 128 -v y 160 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_serial.kind=weak.size=1M.node=32.mpi=36.hyper=1
64 mpiexec -np 4608 -npernode 72 --oversubscribe --bind-to core ./spa_serrano_kokkos_serial -sf kk -k on -pk kokkos reduction parallel/reduce comm classic -v x 256 -v y 160 -v z 160 -v t 100 -in in.free.steps -log log.sparta.date=23Dec17.model=free.machine=serrano.pkg=kokkos_serial.kind=weak.size=1M.node=64.mpi=72.hyper=2