Martin Schroschk · a50b4b89 · 0431fdc3 · a507ec74 · 40076fea · a50b4b89
--- a/doc.zih.tu-dresden.de/docs/jobs_and_resources/slurm_examples.md

+ 30

− 35
+++ b/doc.zih.tu-dresden.de/docs/jobs_and_resources/slurm_examples.md

+ 30

− 35
 @@ -8,8 +8,8 @@ depend on the type of parallelization and architecture.
 ### OpenMP Jobs

 An SMP-parallel job can only run within a node, so it is necessary to include the options `--node=1`
-and `--ntasks=1`. The maximum number of processors for an SMP-parallel program is 896 on
-partition `taurussmp8`, as described in the
+and `--ntasks=1`. The maximum number of processors for an SMP-parallel program is 896 on the cluster
+[`Julia`](julia.md) as described in the
 [section on memory limits](slurm_limits.md#slurm-resource-limits-table). Using the option
 `--cpus-per-task=<N>` Slurm will start one task and you will have `N` CPUs available for your job.
 An example job file would look like:
 @@ -22,8 +22,7 @@ An example job file would look like:
    #SBATCH --tasks-per-node=1
    #SBATCH --cpus-per-task=8
    #SBATCH --time=08:00:00
-    #SBATCH --job-name=Science1
-    #SBATCH --mail-type=end
+    #SBATCH --mail-type=start,end
    #SBATCH --mail-user=<your.email>@tu-dresden.de

    export OMP_NUM_THREADS=$SLURM_CPUS_PER_TASK
 @@ -131,10 +130,6 @@ where `NUM_PER_NODE` is the number of GPUs **per node** that will be used for th
    srun ./your/cuda/application   # start you application (probably requires MPI to use both nodes)
    ```

-With the transition to the sub-clusters it is no longer required to specify the partition with `-p, --partition`.
-It can still be used and will lead to a failure when submitting the job on the wrong cluster.
-This is useful to document the cluster used or avoid accidentally using the wrong SBATCH script.
-
 !!! note

    Due to an unresolved issue concerning the Slurm job scheduling behavior, it is currently not
 @@ -209,10 +204,10 @@ three things:

 1. Allocate enough resources to accommodate multiple instances of our program. This can be achieved
   with an appropriate batch script header (see below).
-1. Start job steps with srun as background processes. This is achieved by adding an ampersand at the
-   end of the `srun` command
+1. Start job steps with `srun` as background processes. This is achieved by adding an ampersand at
+   the end of the `srun` command.
 1. Make sure that each background process gets its private resources. We need to set the resource
-   fraction needed for a single run in the corresponding srun command. The total aggregated
+   fraction needed for a single run in the corresponding `srun` command. The total aggregated
   resources of all job steps must fit in the allocation specified in the batch script header.
   Additionally, the option `--exclusive` is needed to make sure that each job step is provided with
   its private set of CPU and GPU resources.  The following example shows how four independent
 @@ -254,40 +249,40 @@ enough resources in total were specified in the header of the batch script.

 ## Exclusive Jobs for Benchmarking

-Jobs ZIH systems run, by default, in shared-mode, meaning that multiple jobs (from different users)
-can run at the same time on the same compute node. Sometimes, this behavior is not desired (e.g.
-for benchmarking purposes). Thus, the Slurm parameter `--exclusive` request for exclusive usage of
-resources.
-
-Setting `--exclusive` **only** makes sure that there will be **no other jobs running on your nodes**.
-It does not, however, mean that you automatically get access to all the resources which the node
-might provide without explicitly requesting them, e.g. you still have to request a GPU via the
-generic resources parameter (`gres`) to run on the partitions with GPU, or you still have to
-request all cores of a node if you need them. CPU cores can either to be used for a task
-(`--ntasks`) or for multi-threading within the same task (`--cpus-per-task`). Since those two
-options are semantically different (e.g., the former will influence how many MPI processes will be
-spawned by `srun` whereas the latter does not), Slurm cannot determine automatically which of the
-two you might want to use. Since we use cgroups for separation of jobs, your job is not allowed to
-use more resources than requested.*
-
-If you just want to use all available cores in a node, you have to specify how Slurm should organize
-them, like with `--partition=haswell --cpus-per-tasks=24` or `--partition=haswell --ntasks-per-node=24`.
+Jobs on ZIH systems run, by default, in shared-mode, meaning that multiple jobs (from different
+users) can run at the same time on the same compute node. Sometimes, this behavior is not desired
+(e.g.  for benchmarking purposes).  You can request for exclusive usage of resources using the Slurm
+parameter `--exclusive`.
+
+!!! note "Exclusive does not allocate all available resources"
+
+    Setting `--exclusive` **only** makes sure that there will be **no other jobs running on your
+    nodes**.  It does not, however, mean that you automatically get access to all the resources
+    which the node might provide without explicitly requesting them.
+
+    E.g. you still have to request for a GPU via the generic resources parameter (`gres`) on the GPU
+    cluster. On the other hand, you also have to request all cores of a node if you need them.
+
+CPU cores can either to be used for a task (`--ntasks`) or for multi-threading within the same task
+(`--cpus-per-task`). Since those two options are semantically different (e.g., the former will
+influence how many MPI processes will be spawned by `srun` whereas the latter does not), Slurm
+cannot determine automatically which of the two you might want to use. Since we use cgroups for
+separation of jobs, your job is not allowed to use more resources than requested.

 Here is a short example to ensure that a benchmark is not spoiled by other jobs, even if it doesn't
-use up all resources in the nodes:
+use up all resources of the nodes:

-!!! example "Exclusive resources"
+!!! example "Job file with exclusive resources"

    ```Bash
    #!/bin/bash
-    #SBATCH --partition=haswell
    #SBATCH --nodes=2
    #SBATCH --ntasks-per-node=2
    #SBATCH --cpus-per-task=8
-    #SBATCH --exclusive    # ensure that nobody spoils my measurement on 2 x 2 x 8 cores
+    #SBATCH --exclusive            # ensure that nobody spoils my measurement on 2 x 2 x 8 cores
    #SBATCH --time=00:10:00
-    #SBATCH --job-name=Benchmark
-    #SBATCH --mail-type=end
+    #SBATCH --job-name=benchmark
+    #SBATCH --mail-type=start,end
    #SBATCH --mail-user=<your.email>@tu-dresden.de

    srun ./my_benchmark