Skip to content
Snippets Groups Projects
Commit 4ed2338f authored by Moe Jette's avatar Moe Jette
Browse files

Update architecture map to show hierarchical slurmd communications.

parent 296b022e
No related branches found
No related tags found
No related merge requests found
doc/html/arch.gif

26.6 KiB | W: | H:

doc/html/arch.gif

28.1 KiB | W: | H:

doc/html/arch.gif
doc/html/arch.gif
doc/html/arch.gif
doc/html/arch.gif
  • 2-up
  • Swipe
  • Onion skin
...@@ -23,7 +23,9 @@ HP distributes and supports SLURM as a component in their XC System Software.</p ...@@ -23,7 +23,9 @@ HP distributes and supports SLURM as a component in their XC System Software.</p
work. There may also be a backup manager to assume those responsibilities in the work. There may also be a backup manager to assume those responsibilities in the
event of failure. Each compute server (node) has a <b>slurmd</b> daemon, which event of failure. Each compute server (node) has a <b>slurmd</b> daemon, which
can be compared to a remote shell: it waits for work, executes that work, returns can be compared to a remote shell: it waits for work, executes that work, returns
status, and waits for more work. User tools include <b>srun</b> to initiate jobs, status, and waits for more work.
The <b>slurmd</b> daemons provide fault-tolerant hierarchical communciations.
User tools include <b>srun</b> to initiate jobs,
<b>scancel</b> to terminate queued or running jobs, <b>sinfo</b> to report system <b>scancel</b> to terminate queued or running jobs, <b>sinfo</b> to report system
status, and <b>squeue</b> to report the status of jobs. status, and <b>squeue</b> to report the status of jobs.
The <b>smap</b> and <b>sview</b> commands graphically reports system and The <b>smap</b> and <b>sview</b> commands graphically reports system and
......
...@@ -17,8 +17,9 @@ work.</p> ...@@ -17,8 +17,9 @@ work.</p>
<h2>Architecture</h2> <h2>Architecture</h2>
<p>As depicted in Figure 1, SLURM consists of a <b>slurmd</b> daemon running on <p>As depicted in Figure 1, SLURM consists of a <b>slurmd</b> daemon running on
each compute node and a central <b>slurmctld</b> daemon running on a management node each compute node and a central <b>slurmctld</b> daemon running on a management node
(with optional fail-over twin). The user commands include: <b>srun</b>, (with optional fail-over twin).
<b>sbcast</b>, <b>scancel</b>, The <b>slurmd</b> daemons provide fault-tolerant hierarchical communciations.
The user commands include: <b>srun</b>, <b>sbcast</b>, <b>scancel</b>,
<b>sinfo</b>, <b>srun</b>, <b>smap</b>, <b>squeue</b>, and <b>scontrol</b>. <b>sinfo</b>, <b>srun</b>, <b>smap</b>, <b>squeue</b>, and <b>scontrol</b>.
All of the commands can run anywhere in the cluster.</p> All of the commands can run anywhere in the cluster.</p>
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment