Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
e5928562
Commit
e5928562
authored
18 years ago
by
Moe Jette
Browse files
Options
Downloads
Patches
Plain Diff
svn merge -r9565:9590
https://eris.llnl.gov/svn/slurm/branches/slurm-1.1
parent
fd7bbc72
No related branches found
Branches containing commit
No related tags found
No related merge requests found
Changes
4
Hide whitespace changes
Inline
Side-by-side
Showing
4 changed files
NEWS
+1
-1
1 addition, 1 deletion
NEWS
doc/html/moab.shtml
+40
-21
40 additions, 21 deletions
doc/html/moab.shtml
doc/html/quickstart_admin.shtml
+6
-6
6 additions, 6 deletions
doc/html/quickstart_admin.shtml
doc/html/slurm.shtml
+1
-0
1 addition, 0 deletions
doc/html/slurm.shtml
with
48 additions
and
28 deletions
NEWS
+
1
−
1
View file @
e5928562
...
@@ -36,7 +36,7 @@ documents those changes that are of interest to users and admins.
...
@@ -36,7 +36,7 @@ documents those changes that are of interest to users and admins.
-- Moved task layout to slurmctld instead of srun. Job step create returns
-- Moved task layout to slurmctld instead of srun. Job step create returns
step_layout structure with hostnames and addresses that corrisponds
step_layout structure with hostnames and addresses that corrisponds
to those nodes.
to those nodes.
--
c
hanged api slurm_lookup_allocation params,
--
C
hanged api slurm_lookup_allocation params,
resource_allocation_response_msg_t changed to job_alloc_info_response_msg_t
resource_allocation_response_msg_t changed to job_alloc_info_response_msg_t
this structure is being renamed so contents are the same.
this structure is being renamed so contents are the same.
-- alter resource_allocation_response_msg_t see slurm.h.in
-- alter resource_allocation_response_msg_t see slurm.h.in
...
...
This diff is collapsed.
Click to expand it.
doc/html/moab.shtml
+
40
−
21
View file @
e5928562
...
@@ -6,8 +6,8 @@
...
@@ -6,8 +6,8 @@
beyond the scope of any documents we could supply with SLURM.
beyond the scope of any documents we could supply with SLURM.
The best resource for Moab configuration information is the
The best resource for Moab configuration information is the
online documents at Cluster Resources Inc.:
online documents at Cluster Resources Inc.:
<a href="http://www.clusterresources.com/p
ages/resources/documentation/grid/admin.php
">
<a href="http://www.clusterresources.com/p
roducts/mwm/docs/slurmintegration.shtml
">
http://www.clusterresources.com/p
ages/resources/documentation/grid/admin.php
</a>.
http://www.clusterresources.com/p
roducts/mwm/docs/slurmintegration.shtml
</a>.
<h2>Configuration</h2>
<h2>Configuration</h2>
<p>First, download the Moab scheduler kit from their web site
<p>First, download the Moab scheduler kit from their web site
...
@@ -28,21 +28,17 @@ with the <i>sched/wiki</i> plugin (both would try to schedule the
...
@@ -28,21 +28,17 @@ with the <i>sched/wiki</i> plugin (both would try to schedule the
individual processors within a node).</p>
individual processors within a node).</p>
<p>SLURM's wiki configuration is stored in a file
<p>SLURM's wiki configuration is stored in a file
specific to the wiki-plugin. This file should be protected
specific to the wiki-plugin named <i>wiki.conf</i>.
from reading by users. It only needs to be readable by
This file should be protected from reading by users.
<i>SlurmUser</i> (as configured in <i>slurm.conf</i>) and
It only needs to be readable by <i>SlurmUser</i> (as configured
only needs to exist on computers where the <i>slurmctld</i>
in <i>slurm.conf</i>) and only needs to exist on computers
daemon executes.</p>
where the <i>slurmctld</i> daemon executes.
More information about wiki.conf is available in
a man page distributed with SLURM.</p>
<p>The currently supported keywords include:<br>
<p>The currently supported wiki.conf keywords include:</p>
<dt><i>AuthKey</i>
<dd>an encryption key for communctions between
SLURM and Maui or Moab
<dt><i>JobPriority</i>
<dd>controls execution of newly arriving jobs in SLURM
</dt></p>
<p><
i
>AuthKey</
i
> is a DES based encryption key used to sign
<p><
b
>AuthKey</
b
> is a DES based encryption key used to sign
communctions between SLURM and Maui or Moab.
communctions between SLURM and Maui or Moab.
This use of this key is essential to insure that a user
This use of this key is essential to insure that a user
not build his own program to cancel other user's jobs in
not build his own program to cancel other user's jobs in
...
@@ -54,8 +50,24 @@ configure line) or Moab (<i>KEY</i> parameter in the
...
@@ -54,8 +50,24 @@ configure line) or Moab (<i>KEY</i> parameter in the
Note that SLURM's wiki plugin does not include a mechanism
Note that SLURM's wiki plugin does not include a mechanism
to submit new jobs, so even without this key nobody could
to submit new jobs, so even without this key nobody could
run jobs as another user.</p>
run jobs as another user.</p>
<p><i>JobPriority</i> controls the scheduling of newly arriving
<p><b>EPort</b> is an event notification port in Moab.
When a job is submitted to or terminates in SLURM,
Moab is sent a message on this port to begin an attempt
to schedule the computer.
This numeric value should match <i>EPORT</i> configured
in the <i>moab.cnf</i> file.</p>
<p><b>JobAggregationTime</b> is used to avoid notifying Moab
of large numbers of events occuring about the same time.
If an event occurs within this number of seconds since Moab was
last notified of an event, another notification is not sent.
This should be an integer number of seconds.
The default value is 10 seconds.
The value should match <i>JOBAGGREGATIONTIME</i> configured
in the <i>moab.cnf</i> file.</p>
<p><b>JobPriority</b> controls the scheduling of newly arriving
jobs in SLURM.
jobs in SLURM.
SLURM can either place all newly arriving jobs in a HELD state
SLURM can either place all newly arriving jobs in a HELD state
(priority = 0) and let Moab decide when and where to run the jobs
(priority = 0) and let Moab decide when and where to run the jobs
...
@@ -74,24 +86,31 @@ AuthKey=1234
...
@@ -74,24 +86,31 @@ AuthKey=1234
#
#
# Have Moab control job scheduling
# Have Moab control job scheduling
JobPriority=hold
JobPriority=hold
#
# Moab event notification port, matches EPORT in moab.cfg
EPort=15017
#
# Moab event notifcation throttle, matches JOBAGGREGATIONTIME
# in moab.cfg (integer value in seconds)
JobAggregationTime=15
</pre>
</pre>
</p>
</p>
<h3>Moab Configuration</h3>
<h3>Moab Configuration</h3>
<p>Moab has support for SLURM's WIKI interface by default.
<p>Moab has support for SLURM's WIKI interface by default.
Specify this interface in the <i>moab.cfg</i> file as follows:
Specify this interface in the <i>moab.cfg</i> file as follows:
</p>
<pre>
<pre>
RMCFG[slurm] TYPE=wiki:slurm AUTHTYPE=CHECKSUM
RMCFG[slurm] TYPE=wiki:slurm AUTHTYPE=CHECKSUM
</pre>
</pre>
In <i>moab-private.cfg</i> specify the private key as follows:
<p>
In <i>moab-private.cfg</i> specify the private key as follows:
</p>
<pre>
<pre>
KEY=1234
KEY=1234
</pre>
</pre>
Insure that this file is protected from viewing by users. </p>
<p>
Insure that this file is protected from viewing by users. </p>
<p class="footer"><a href="#top">top</a></p>
<p class="footer"><a href="#top">top</a></p>
<p style="text-align:center;">Last modified 2
4 August
2006</p>
<p style="text-align:center;">Last modified 2
9 September
2006</p>
<!--#include virtual="footer.txt"-->
<!--#include virtual="footer.txt"-->
This diff is collapsed.
Click to expand it.
doc/html/quickstart_admin.shtml
+
6
−
6
View file @
e5928562
...
@@ -553,8 +553,8 @@ adev0: scontrol shutdown
...
@@ -553,8 +553,8 @@ adev0: scontrol shutdown
<h2>Testing</h2>
<h2>Testing</h2>
<p>An extensive test suite is available within the SLURM distribution
<p>An extensive test suite is available within the SLURM distribution
in <i>testsuite/expect</i>.
in <i>testsuite/expect</i>.
There are about
1
50 tests which will execute on the order of
1
000 jobs
There are about
2
50 tests which will execute on the order of
2
000 jobs
and
2
000 job steps.
and
4
000 job steps.
Depending upon your system configuration and performance, this test
Depending upon your system configuration and performance, this test
suite will take roughly 40 minutes to complete.
suite will take roughly 40 minutes to complete.
The file <i>testsuite/expect/globals</i> contains default paths and
The file <i>testsuite/expect/globals</i> contains default paths and
...
@@ -568,19 +568,19 @@ or the full test suite may be executed with the single command
...
@@ -568,19 +568,19 @@ or the full test suite may be executed with the single command
See <i>testsuite/expect/README</i> for more information.</p>
See <i>testsuite/expect/README</i> for more information.</p>
<h2>Upgrades</h2>
<h2>Upgrades</h2>
<p>When upgrading to a new major or minor release of SLURM (e.g.
0.3
.x to
0.4
.x)
<p>When upgrading to a new major or minor release of SLURM (e.g.
1.1
.x to
1.2
.x)
all running and pending jobs will be purged due to changes in state save
all running and pending jobs will be purged due to changes in state save
information. It is possible to develop software to translate state information
information. It is possible to develop software to translate state information
between versions, but we do not normally expect to do so.
between versions, but we do not normally expect to do so.
When upgrading to a new micro release of SLURM (e.g.
0.3
.1 to
0.3
.2) all
When upgrading to a new micro release of SLURM (e.g.
1.2
.1 to
1.2
.2) all
running and pending jobs will be preserved. Just install a new version of
running and pending jobs will be preserved. Just install a new version of
SLURM and restart the daemons.
SLURM and restart the daemons.
An exception to this is that jobs may be lost when installing new pre-release
An exception to this is that jobs may be lost when installing new pre-release
versions (e.g.
0.4
.0-pre1 to
0.4
.0-pre2). We'll try to note these cases
versions (e.g.
1.3
.0-pre1 to
1.3
.0-pre2). We'll try to note these cases
in the NEWS file.
in the NEWS file.
</pre> <p class="footer"><a href="#top">top</a></p>
</pre> <p class="footer"><a href="#top">top</a></p>
<p style="text-align:center;">Last modified
11 April
2006</p>
<p style="text-align:center;">Last modified
29 September
2006</p>
<!--#include virtual="footer.txt"-->
<!--#include virtual="footer.txt"-->
This diff is collapsed.
Click to expand it.
doc/html/slurm.shtml
+
1
−
0
View file @
e5928562
...
@@ -40,6 +40,7 @@ nodes, each having four Itanium2 processors</li>
...
@@ -40,6 +40,7 @@ nodes, each having four Itanium2 processors</li>
<li><a href="http://www.llnl.gov/asci/platforms/purple/">ASC Purple</a> an IBM SP/AIX
<li><a href="http://www.llnl.gov/asci/platforms/purple/">ASC Purple</a> an IBM SP/AIX
cluster with 1500 nodes, each having eight Power5 processors</li>
cluster with 1500 nodes, each having eight Power5 processors</li>
</ul>
</ul>
There are about 150 downloads of SLURM per month from LLNL's FTP server alone.
As of September 2006, SLURM has been downloaded over 3500 times to over 500
As of September 2006, SLURM has been downloaded over 3500 times to over 500
distinct sites in 38 countries.
distinct sites in 38 countries.
SLURM is also distributed and supported by <a href="http://www.hp.com">
SLURM is also distributed and supported by <a href="http://www.hp.com">
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment