Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
352476f4
Commit
352476f4
authored
13 years ago
by
Morris Jette
Browse files
Options
Downloads
Patches
Plain Diff
Correct faq.html numbering
parent
f67d7daa
No related branches found
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
doc/html/faq.shtml
+7
-7
7 additions, 7 deletions
doc/html/faq.shtml
with
7 additions
and
7 deletions
doc/html/faq.shtml
+
7
−
7
View file @
352476f4
...
@@ -88,7 +88,7 @@ SLURM? Why does the DAKOTA program not run with SLURM?</a></li>
...
@@ -88,7 +88,7 @@ SLURM? Why does the DAKOTA program not run with SLURM?</a></li>
core files?</a></li>
core files?</a></li>
<li><a href="#limit_propagation">Is resource limit propagation
<li><a href="#limit_propagation">Is resource limit propagation
useful on a homogeneous cluster?</a></li>
useful on a homogeneous cluster?</a></li>
<li<a href="#clock">Do I need to maintain synchronized clocks
<li
>
<a href="#clock">Do I need to maintain synchronized clocks
on the cluster?</a></li>
on the cluster?</a></li>
<li><a href="#cred_invalid">Why are "Invalid job credential" errors
<li><a href="#cred_invalid">Why are "Invalid job credential" errors
generated?</a></li>
generated?</a></li>
...
@@ -1396,7 +1396,7 @@ address instead of the correct address and make it so the
...
@@ -1396,7 +1396,7 @@ address instead of the correct address and make it so the
communication doesn't work. Solution is to either remove this line or
communication doesn't work. Solution is to either remove this line or
set a different nodeaddr that is known by your other nodes.</p>
set a different nodeaddr that is known by your other nodes.</p>
<p><a name="stop_sched"><b>3
8
. How can I stop SLURM from scheduling jobs?</b></a></br>
<p><a name="stop_sched"><b>3
9
. How can I stop SLURM from scheduling jobs?</b></a></br>
You can stop SLURM from scheduling jobs on a per partition basis by setting
You can stop SLURM from scheduling jobs on a per partition basis by setting
that partition's state to DOWN. Set its state UP to resume scheduling.
that partition's state to DOWN. Set its state UP to resume scheduling.
For example:
For example:
...
@@ -1405,7 +1405,7 @@ $ scontrol update PartitionName=foo State=DOWN
...
@@ -1405,7 +1405,7 @@ $ scontrol update PartitionName=foo State=DOWN
$ scontrol update PartitionName=bar State=UP
$ scontrol update PartitionName=bar State=UP
</pre></p>
</pre></p>
<p><a name="scontrol_multi_jobs"><b>
39
. Can I update multiple jobs with a
<p><a name="scontrol_multi_jobs"><b>
40
. Can I update multiple jobs with a
single <i>scontrol</i> command?</b></a></br>
single <i>scontrol</i> command?</b></a></br>
No, but you can probably use <i>squeue</i> to build the script taking
No, but you can probably use <i>squeue</i> to build the script taking
advantage of its filtering and formatting options. For example:
advantage of its filtering and formatting options. For example:
...
@@ -1413,7 +1413,7 @@ advantage of its filtering and formatting options. For example:
...
@@ -1413,7 +1413,7 @@ advantage of its filtering and formatting options. For example:
$ squeue -tpd -h -o "scontrol update jobid=%i priority=1000" >my.script
$ squeue -tpd -h -o "scontrol update jobid=%i priority=1000" >my.script
</pre></p>
</pre></p>
<p><a name="amazon_ec2"><b>4
0
. Can SLURM be used to run jobs on
<p><a name="amazon_ec2"><b>4
1
. Can SLURM be used to run jobs on
Amazon's EC2?</b></a></br>
Amazon's EC2?</b></a></br>
<p>Yes, here is a description of use SLURM use with
<p>Yes, here is a description of use SLURM use with
<a href="http://aws.amazon.com/ec2/">Amazon's EC2</a> courtesy of
<a href="http://aws.amazon.com/ec2/">Amazon's EC2</a> courtesy of
...
@@ -1437,7 +1437,7 @@ which I then copy over the /usr/local on the first instance and NFS export to
...
@@ -1437,7 +1437,7 @@ which I then copy over the /usr/local on the first instance and NFS export to
all other instances. This way I have persistent home directories and a very
all other instances. This way I have persistent home directories and a very
simple first-login script that configures the virtual cluster for me.</p>
simple first-login script that configures the virtual cluster for me.</p>
<p><a name="core_dump"><b>4
1
. If a SLURM daemon core dumps, where can I find the
<p><a name="core_dump"><b>4
2
. If a SLURM daemon core dumps, where can I find the
core file?</b></a></br>
core file?</b></a></br>
<p>For <i>slurmctld</i> the core file will be in the same directory as its
<p>For <i>slurmctld</i> the core file will be in the same directory as its
log files (<i>SlurmctldLogFile</i>) iif configured using an fully qualified
log files (<i>SlurmctldLogFile</i>) iif configured using an fully qualified
...
@@ -1453,7 +1453,7 @@ Otherwise it will be found in directory used for saving state
...
@@ -1453,7 +1453,7 @@ Otherwise it will be found in directory used for saving state
occurs. It will either be in spawned job's working directory on the same
occurs. It will either be in spawned job's working directory on the same
location as that described above for the <i>slurmd</i> daemon.</p>
location as that described above for the <i>slurmd</i> daemon.</p>
<p><a name="totalview"><b>4
2
. How can TotalView be configured to operate with
<p><a name="totalview"><b>4
3
. How can TotalView be configured to operate with
SLURM?</b></a></br>
SLURM?</b></a></br>
<p>The following lines should also be added to the global <i>.tvdrc</i> file
<p>The following lines should also be added to the global <i>.tvdrc</i> file
for TotalView to operate with SLURM:
for TotalView to operate with SLURM:
...
@@ -1470,7 +1470,7 @@ dset TV::parallel_configs {
...
@@ -1470,7 +1470,7 @@ dset TV::parallel_configs {
}
}
</pre></p>
</pre></p>
<p><a name="git_patch"><b>4
3
. How can a patch file be generated from a SLURM
<p><a name="git_patch"><b>4
4
. How can a patch file be generated from a SLURM
commit in github?</b></a></br>
commit in github?</b></a></br>
<p>Find and open the commit in github then append ".patch" to the URL and save
<p>Find and open the commit in github then append ".patch" to the URL and save
the resutling file. For an example, see:
the resutling file. For an example, see:
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment