Run Calculations
To run calculations on the cluster, users need to have
the code they plan to run
any input it requires
a way to sub the code using a batch scheduler
Managing Jobs
HPC utilizes SLURM to manage jobs that users submit to various queues on a computer system. Each queue represents a group of resources with attributes necessary for the queue's jobs. You can see the list of queues that HPC has by typing sinfo
. stdmemq is the default partition/queue.
Common Commands
The table below gives a short description of the most used SLURM commands.
Note: Do not run jobs on the login nodes. All jobs launched from those nodes will be terminated without notice.
Listing jobs
To list all jobs:
To list your jobs:
To obtain the status of a job, run the following command using the job's ID number (this is provided at time of job submission).
You can also use checkjob job_ID
to show the current status of the job.
Submitting a job
To submit a job, use the sbatch
command, followed by the name of your submission file. A Job ID will be provided. You may want to make note of the ID for later use.
Deleting a job
Note: Be aware that deleting a job cannot be undone. Double check the job ID before deleting a job.
Users can delete their jobs by typing the following command.
To delete all the jobs of a user:
Overview of resources
The sinfo
command gives an overview of what resources are in each partition/queue and what their status is. It should inform your decisions on how you structure your jobs and what partition you should submit them to.
You can format that output in a more concise form:
Status of past and current jobs
The sacct
command gives some accounting details on past and current jobs.
You can format that output in a more detailed form:
SLURM environmental variables
When a SLURM job is scheduled to run, some relevant information about the job such as the names of the nodes it is running on, the number of cores, the working directory ... etc ... are saved as environmental variables. Users can invoke these environmental variables in their job submission scripts.
Below is a list of the most common SLURM environmental variables including with a brief description from UMD's HPC page.
Last updated