AMBER
Examples at https://github.com/hpc-cofc/example-runs/tree/master/chemistry/AMBER
AMBER 18
Idea
This calculation simulates a 136475-atom biological molecule for 100,000 time steps (0.2 nanoseconds). It also reports how long of a simulation we can run in a day if we were to run it in
one CPU-only node (
one-node
) with 40 cores = 40 MPI processestwo CPU-only nodes (
two-nodes
) with 80 cores in total = 80 MPI processesone node with NVIDIA Quadro P4000 GPU (
gpu-quadro-p4000
) = 1 process on CPUs with massive processing on GPUone node with NVIDIA Tesla V100 GPU (
gpu-tesla-v100
) = 1 process on CPUs with massive processing on GPU
It demonstrates
the power parallelization using MPI
compare calculations on a single node (smp) vs. multiple nodes (mpi)
the power of GPUs
compare calculations on CPUs (smp, mpi) vs. GPUs (p4000, v100)
different between consumer-grade and datacenter GPUs
compare consumer-grade (NVidia Quadro P4000) and datacenter (NVIDIA Tesla V100) GPU
How to run
On the CofC HPC cluster, you can go into the one-node
, two-nodes
, gpu-quadro-p4000
and gpu-tesla-v100
directories and enter sbatch run.slurm
Results
Here is a summary from a sample run.
Calculation | Number of MPI processes | GPUs? | Speed of simulation (ns/day) [higher is better] |
./single-node/40.out: | 40 | - | 9.17 |
./two-nodes/80.out: | 80 | - | 14.49 |
./gpu-quadro-p4000/quadro.out: | 1 | 1 | 31.45 |
./gpu-tesla-v100/24.out: | 1 | 1 | 116.84 |
Last updated