Skip to content
Snippets Groups Projects
Commit 9c91142c authored by Mark Grondona's avatar Mark Grondona
Browse files

o added arch.png and entities.png

parent b21c67ef
No related branches found
No related tags found
No related merge requests found
doc/html/arch.png

52.5 KiB

doc/html/entities.png

9.67 KiB

...@@ -7,19 +7,18 @@ ...@@ -7,19 +7,18 @@
<h2>Overview</h2> <h2>Overview</h2>
Simple Linux Utility for Resource Management (SLURM) is an open source, The Simple Linux Utility for Resource Management (SLURM) is an open
fault-tolerant, and highly scalable cluster management and job source, fault-tolerant, and highly scalable cluster management and job
scheduling system for Linux clusters large and small. scheduling system for Linux clusters large and small. SLURM requires
SLURM requires no kernel modifications for it operation and is no kernel modifications for it operation and is relatively self-contained.
relatively self-contained.
As a cluster resource manager, SLURM has three key functions. First, As a cluster resource manager, SLURM has three key functions. First,
it allocates exclusive and/or non-exclusive access to resources it allocates exclusive and/or non-exclusive access to resources (compute
(compute nodes) to users for nodes) to users for some duration of time so they can perform work.
some duration of time so they can perform work. Second, it provides Second, it provides a framework for starting, executing, and monitoring
a framework for starting, executing, and monitoring work (normally a work (normally a parallel job) on the set of allocated nodes. Finally,
parallel job) on the set of allocated nodes. Finally, it arbitrates it arbitrates conflicting requests for resources by managing a queue of
conflicting requests for resources by managing a queue of pending work. pending work.
<h2>Architecture</h2> <h2>Architecture</h2>
...@@ -220,33 +219,26 @@ SLURM logs from multiple nodes. ...@@ -220,33 +219,26 @@ SLURM logs from multiple nodes.
<h3>Configuration</h3> <h3>Configuration</h3>
The SLURM configuration file includes a wide variety of parameters. The SLURM configuration file includes a wide variety of
A full description of the parameters is included in the <i>slurm.conf</i> parameters. A full description of the parameters is included in the
man page. <i>slurm.conf</i> man page. Rather than duplicate that information,
Rather than duplicate that information, a sample configuration file a sample configuration file is shown below. Any text following a
is shown below. "#" is considered a comment. The keywords in the file are not case
Any text following a "#" is considered a comment. sensitive, although the argument typically is (e.g. "SlurmUser=slurm"
The keywords in the file are not case sensitive, might be specified as "slurmuser=slurm"). The control machine, like
although the argument typically is (e.g. "SlurmUser=slurm" all other machine specifications can include both the host name and
might be specified as "slurmuser=slurm"). the name used for communications. In this case, the host's name is
The control machine, like all other machine specifications can "mcri" and the name "emcri" is used for communications. The "e" prefix
include both the host name and the name used for communications. identifies this as an ethernet address at this site. Port numbers to be
In this case, the host's name is "mcri" and the name "emcri" is used for communications are specified as well as various timer values.
used for communications. The "e" prefix identifies this as an On DPCS systems set FirstJobId to 65536 or higher. This will permit
ethernet address at this site. DPCS to specify a SLURM job id to match its own job id without conflicts
Port numbers to be used for communications are specified as from jobs submitted to SLURM by other means. <p> A description of the
well as various timer values. nodes and their grouping into non-overlapping partitions is required.
On DPCS systems set FirstJobId to 65536 or higher. Partition and node specifications use node range expressions to identify
This will permit DPCS to specify a SLURM job id to match its own job id nodes in a concise fashion. This configuration file defines a 1154 node
without conflicts from jobs submitted to SLURM by other means. cluster for SLURM, but might be used for a much larger cluster by just
<p> changing a few node range expressions.
A description of the nodes and their grouping into non-overlapping
partitions is required.
Partition and node specifications use node range expressions to
identify nodes in a concise fashion.
This configuration file defines a 1154 node cluster for SLURM, but
might be used for a much larger cluster by just changing a
few node range expressions.
<pre> <pre>
# #
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment