Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
927f634d
Commit
927f634d
authored
14 years ago
by
Moe Jette
Browse files
Options
Downloads
Patches
Plain Diff
update to cray admin web page
parent
e1df3d89
No related branches found
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
doc/html/cray.shtml
+35
-5
35 additions, 5 deletions
doc/html/cray.shtml
with
35 additions
and
5 deletions
doc/html/cray.shtml
+
35
−
5
View file @
927f634d
...
@@ -91,7 +91,7 @@ http://code.google.com/p/munge/downloads/list</a></li>
...
@@ -91,7 +91,7 @@ http://code.google.com/p/munge/downloads/list</a></li>
which has already the correct paths - in particular, the libmunge.* need to
which has already the correct paths - in particular, the libmunge.* need to
reside in <i>/usr/lib64</i> since SLURM plugins call them.</p>
reside in <i>/usr/lib64</i> since SLURM plugins call them.</p>
<p>Install tar-ball by
<p>Install
the
tar-ball by
executing:
<ul>
<ul>
<li>scp $LIBROOT/munge_build-.*YYYY-MM-DD.tar.gz boot:</li>
<li>scp $LIBROOT/munge_build-.*YYYY-MM-DD.tar.gz boot:</li>
<li>ssh boot</li>
<li>ssh boot</li>
...
@@ -111,13 +111,16 @@ reside in <i>/usr/lib64</i> since SLURM plugins call them.</p>
...
@@ -111,13 +111,16 @@ reside in <i>/usr/lib64</i> since SLURM plugins call them.</p>
<i>mkdir --mode=0711 -vp /var/lib/munge</i><br>
<i>mkdir --mode=0711 -vp /var/lib/munge</i><br>
<i>mkdir --mode=0700 -vp /var/log/munge</i><br>
<i>mkdir --mode=0700 -vp /var/log/munge</i><br>
<i>mkdir --mode=0755 -vp /var/run/munge</i></li>
<i>mkdir --mode=0755 -vp /var/run/munge</i></li>
<li><i>module load slurm</i> as root, to get the
right slurm
paths</li>
<li><i>module load slurm</i> as root, to get the
correct SLURM
paths</li>
<li><i>munged --key-file /opt/slurm/munge/etc/munge.key</i></li>
<li><i>munged --key-file /opt/slurm/munge/etc/munge.key</i></li>
<li>Try a "<i>munge -n</i>" to see if munged accepts input</li>
<li>Try a "<i>munge -n</i>" to see if munged accepts input</li>
</ol></p>
</ol></p>
<p>When done, verify network connectivity by executing<br>
<p>When done, verify network connectivity by executing:
$ <i>munge -n |ssh other-login-host /opt/slurm/munge/bin/unmunge</i></p>
<ul>
<li>munge -n |ssh other-login-host /opt/slurm/munge/bin/unmunge</li>
</ul>
<p>If you decide to keep the installation, you may be interested in automating
<p>If you decide to keep the installation, you may be interested in automating
the process using an <i>init.d</i> script distributed with the Munge as
the process using an <i>init.d</i> script distributed with the Munge as
...
@@ -221,8 +224,35 @@ SDBuser=alps_user
...
@@ -221,8 +224,35 @@ SDBuser=alps_user
SDBdb=XT5istanbul
SDBdb=XT5istanbul
</pre>
</pre>
<p>One additional configuration script can be used to insure that the slurmd
daemons execute with the highest resource limits possible, overriding default
limits on Suse systems. Depending upon what resource limits are propagated
from the user's environment, lower limits may apply to user jobs, but this
script will insure that higher limits are possible. Copy the file
<i>contribs/cray/etc_sysconfig_slurm</i> into <i>/etc/sysconfig/slurm</i>
for these limits to take effect. This script is executed from
<i>/etc/init.d/slurm</i>, which is typically executed to start the SLURM
daemons. An excerpt of <i>contribs/cray/etc_sysconfig_slurm</i>is shown
below.</p>
<pre>
#
# /etc/sysconfig/slurm for Cray XT/XE systems
#
# Cray is SuSe-based, which means that ulimits from
# /etc/security/limits.conf will get picked up any time SLURM is
# restarted e.g. via pdsh/ssh. Since SLURM respects configured limits,
# this can mean that for instance batch jobs get killed as a result
# of configuring CPU time limits. Set sane start limits here.
#
# Values were taken from pam-1.1.2 Debian package
ulimit -t unlimited # max amount of CPU time in seconds
ulimit -d unlimited # max size of a process's data segment in KB
</pre>
<p class="footer"><a href="#top">top</a></p>
<p class="footer"><a href="#top">top</a></p>
<p style="text-align:center;">Last modified 2
0 February
2011</p></td>
<p style="text-align:center;">Last modified 2
8 March
2011</p></td>
<!--#include virtual="footer.txt"-->
<!--#include virtual="footer.txt"-->
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment