Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
14b15749
Commit
14b15749
authored
10 years ago
by
Morris Jette
Browse files
Options
Downloads
Patches
Plain Diff
Major update to power management web page
parent
00f40ddc
No related branches found
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
doc/html/power_mgmt.shtml
+26
-7
26 additions, 7 deletions
doc/html/power_mgmt.shtml
with
26 additions
and
7 deletions
doc/html/power_mgmt.shtml
+
26
−
7
View file @
14b15749
...
@@ -21,6 +21,9 @@ Note this functionality is distinct from Slurm's ability to
...
@@ -21,6 +21,9 @@ Note this functionality is distinct from Slurm's ability to
<p>The following configuration parameters are available:
<p>The following configuration parameters are available:
<ul>
<ul>
<li><b>DebugFlags=power</b>:
Enable plugin-specific logging messages.</li>
<li><b>PowerParameters</b>:
<li><b>PowerParameters</b>:
Defines power management behavior.
Defines power management behavior.
Changes to this value take effect when the Slurm daemons are reconfigured.
Changes to this value take effect when the Slurm daemons are reconfigured.
...
@@ -56,6 +59,17 @@ Currently valid options are:
...
@@ -56,6 +59,17 @@ Currently valid options are:
maximum power consumption.
maximum power consumption.
The default value is 20 percent.
The default value is 20 percent.
Supported by the power/cray plugin.</li>
Supported by the power/cray plugin.</li>
<li><b>job_level</b> -
All compute nodes associated with every job will be assigned the same power
cap.
Nodes shared by multiple jobs with have a power cap different from other
nodes allocated to the individual jobs.
By default, this is configurable by the user for each job.</li>
<li><b>job_no_level</b> -
Power caps are established independently for each compute node.
This disabled the "--power=level" option available in the job submission
commands.
By default, this is configurable by the user for each job.</li>
<li><b>lower_threshold=#</b> -
<li><b>lower_threshold=#</b> -
Specify a lower power consumption threshold.
Specify a lower power consumption threshold.
If a node's current power consumption is below this percentage of its current
If a node's current power consumption is below this percentage of its current
...
@@ -101,6 +115,17 @@ or loading the capmc module:</p>
...
@@ -101,6 +115,17 @@ or loading the capmc module:</p>
$ module load capmc
$ module load capmc
</pre>
</pre>
<h2>User and System Administrator Commands</h2>
<p>Equal power caps for all nodes allocated to a job can be requested at job
submission time by using the "--power=level" option with the salloc, sbatch
or srun command.
The system administrator can override the user option with the PowerParameters
configuration parameter and the job_level or job_no_level option.</p>
<p>The power consumption and power cap data are available for all compute nodes
using either the "scontrol show node" or sview commands.</p>
<h2>Example</h2>
<h2>Example</h2>
<h3>Initial State</h3>
<h3>Initial State</h3>
...
@@ -162,12 +187,6 @@ to be distributed over 9 nodes or 184 watts per node.</p>
...
@@ -162,12 +187,6 @@ to be distributed over 9 nodes or 184 watts per node.</p>
<ul>
<ul>
<li>We need to determine how the current power consumption data is gathered
<li>We need to determine how the current power consumption data is gathered
and determine accuracy of that data.</li>
and determine accuracy of that data.</li>
<li>The current algorithm manages power on a per-node basis without considering
job allocations. Maintaining uniform power caps across all nodes associated
with each job might improve performance, but complicate the logic. One option
might be to add a job option of this sort "--power=equal" to indicate that
all nodes associated with a job should have the same power cap, even if their
power consumptions differ.</li>
<li>Cray permits independent power capping for accelerators (GPUs or MICs),
<li>Cray permits independent power capping for accelerators (GPUs or MICs),
which is not currently used by Slurm.</li>
which is not currently used by Slurm.</li>
<li>Current default values for configuration parameters should probably be
<li>Current default values for configuration parameters should probably be
...
@@ -180,6 +199,6 @@ to be distributed over 9 nodes or 184 watts per node.</p>
...
@@ -180,6 +199,6 @@ to be distributed over 9 nodes or 184 watts per node.</p>
That should be practical to add in the future, but has not yet been studied.</li>
That should be practical to add in the future, but has not yet been studied.</li>
</ul>
</ul>
<p style="text-align:center;">Last modified
30 Jan
uary 2015</p>
<p style="text-align:center;">Last modified
12 Febr
uary 2015</p>
<!--#include virtual="footer.txt"-->
<!--#include virtual="footer.txt"-->
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment