Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
9381f867
Commit
9381f867
authored
12 years ago
by
Danny Auble
Browse files
Options
Downloads
Patches
Plain Diff
better documentation about GrpCPUMins and GrpCPURunMins
parent
39512911
No related branches found
No related tags found
No related merge requests found
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
doc/html/accounting.shtml
+11
-7
11 additions, 7 deletions
doc/html/accounting.shtml
doc/man/man1/sacctmgr.1
+41
-39
41 additions, 39 deletions
doc/man/man1/sacctmgr.1
with
52 additions
and
46 deletions
doc/html/accounting.shtml
+
11
−
7
View file @
9381f867
...
@@ -638,17 +638,21 @@ in the <a href="resource_limits.html">Resource Limits</a> document.</p>
...
@@ -638,17 +638,21 @@ in the <a href="resource_limits.html">Resource Limits</a> document.</p>
this means that the parent association is used for fairshare.
this means that the parent association is used for fairshare.
</li>
</li>
<li><b>GrpCPUMins=</b> A hard limit of cpu minutes to be used by jobs
<li><b>GrpCPUMins=</b> The total number of cpu minutes that can
possibly be used by past, present and future jobs
running from this association and its children. If this limit is
running from this association and its children. If this limit is
reached all jobs running in this group will be killed, and no new
reached all jobs running in this group will be killed, and no new
jobs will be allowed to run.
jobs will be allowed to run. This usage is decayed (at a rate of
PriorityDecayHalfLife). It can also be reset (according to
PriorityUsageResetPeriod) in order to allow jobs to run against the
association tree again.
</li>
</li>
<li><b>GrpCPURunMins=</b>
Maximum number of CPU minutes all jobs
<li><b>GrpCPURunMins=</b>
Used to limit the combined total number of CPU
running with this association and its
children can run at the same
minutes used by all jobs
running with this association and its
time
. This takes into consideration time limit of
running jobs. If
children
. This takes into consideration time limit of
the limit is reached no new jobs are started until other jobs finish
running jobs and consumes it, if the limit is reached no new jobs
to allow time to free up.
are started until other jobs finish
to allow time to free up.
</li>
</li>
<li><b>GrpCPUs=</b> The total count of cpus able to be used at any given
<li><b>GrpCPUs=</b> The total count of cpus able to be used at any given
...
...
This diff is collapsed.
Click to expand it.
doc/man/man1/sacctmgr.1
+
41
−
39
View file @
9381f867
...
@@ -253,9 +253,8 @@ NOTE: This value is only meaningful for QOS PreemptMode=CANCEL)
...
@@ -253,9 +253,8 @@ NOTE: This value is only meaningful for QOS PreemptMode=CANCEL)
.TP
.TP
\fIGrpCPUMins\fP=<max cpu minutes>
\fIGrpCPUMins\fP=<max cpu minutes>
Maximum number of CPU minutes running jobs are able to be allocated in
The total number of cpu minutes that can possibly be used by past,
aggregate for this association and all associations which are children
present and future jobs running from this association and its children.
of this association.
To clear a previously set value use the modify command with a new
To clear a previously set value use the modify command with a new
value of \-1.
value of \-1.
.P
.P
...
@@ -272,11 +271,11 @@ they are able to run inside the limit.
...
@@ -272,11 +271,11 @@ they are able to run inside the limit.
.TP
.TP
\fIGrpCPURunMins\fP=<max cpu run minutes>
\fIGrpCPURunMins\fP=<max cpu run minutes>
Maximum
number of CPU minutes
all jobs
Used to limit the combined total
number of CPU minutes
used by all
running with this association and
all
it
'
s child
associations can run
jobs
running with this association and its child
ren. This takes into
at the same time. This takes into
consideration time limit of running
consideration time limit of running
jobs and consumes it, if the limit
jobs. If the limit
is reached no new jobs are started until other
is reached no new jobs are started until other
jobs finish to allow
jobs finish to allow
time to free up.
time to free up.
.TP
.TP
\fIGrpCPUs\fP=<max cpus>
\fIGrpCPUs\fP=<max cpus>
...
@@ -570,17 +569,16 @@ parent association is used for fairshare.
...
@@ -570,17 +569,16 @@ parent association is used for fairshare.
.TP
.TP
\fIGrpCPUMins\fP
\fIGrpCPUMins\fP
Maximum number of CPU minutes running jobs are able to be allocated in
The total number of cpu minutes that can possibly be used by past,
aggregate for this association and all associations which are children
present and future jobs running from this association and its children.
of this association.
.TP
.TP
\fIGrpCPURunMins\fP
\fIGrpCPURunMins\fP
Maximum
number of CPU minutes
all jobs
Used to limit the combined total
number of CPU minutes
used by all
running with this association and
all
it
'
s child
associations can run
jobs
running with this association and its child
ren. This takes into
at the same time. This takes into
consideration time limit of running
consideration time limit of running
jobs and consumes it, if the limit
jobs. If the limit
is reached no new jobs are started until other
is reached no new jobs are started until other
jobs finish to allow
jobs finish to allow
time to free up.
time to free up.
.TP
.TP
\fIGrpCPUs\fP
\fIGrpCPUs\fP
...
@@ -980,15 +978,15 @@ selected for preemption.
...
@@ -980,15 +978,15 @@ selected for preemption.
.TP
.TP
\fIGrpCPUMins\fP
\fIGrpCPUMins\fP
Maximum
number of
CPU
minutes
running jobs are able to be allocated in
The total
number of
cpu
minutes
that can possibly be used by past,
aggregate for
this QOS.
present and future jobs running from
this QOS.
.TP
.TP
\fIGrpCPURunMins\fP
Maximum number of CPU minutes all jobs
\fIGrpCPURunMins\fP
Used to limit the combined total number of CPU
running with this QOS can run at the same time
. This takes into
minutes used by all jobs running with this QOS
. This takes into
consideration time limit of running jobs
. If the limit is reached
consideration time limit of running jobs
and consumes it, if the limit
no new jobs are started until other jobs finish to allow
time to
is reached
no new jobs are started until other jobs finish to allow
free up.
time to
free up.
.TP
.TP
\fIGrpCPUs\fP
\fIGrpCPUs\fP
...
@@ -1104,8 +1102,8 @@ NOTE: This value is only meaningful for QOS PreemptMode=CANCEL.
...
@@ -1104,8 +1102,8 @@ NOTE: This value is only meaningful for QOS PreemptMode=CANCEL.
.TP
.TP
\fIGrpCPUMins\fP
\fIGrpCPUMins\fP
Maximum
number of
CPU
minutes
running jobs are able to be allocated in
The total
number of
cpu
minutes
that can possibly be used by past,
aggregate for
this QOS.
present and future jobs running from
this QOS.
To clear a previously set value use the modify command with a new
To clear a previously set value use the modify command with a new
value of \-1.
value of \-1.
NOTE: This limit only applies when using the Priority Multifactor
NOTE: This limit only applies when using the Priority Multifactor
...
@@ -1462,18 +1460,20 @@ Anything included on this line will be the defaults for all
...
@@ -1462,18 +1460,20 @@ Anything included on this line will be the defaults for all
associations on this cluster. These options are as follows...
associations on this cluster. These options are as follows...
.TP
.TP
\fIGrpCPUMins=\fP
\fIGrpCPUMins=\fP
Maximum number of CPU hours running jobs are able to
The total number of cpu minutes that can possibly be used by past,
be allocated in aggregate for this association and all associations
present and future jobs running from this association and its children.
which are children of this association. (NOTE: This limit is not
.TP
currently enforced in SLURM. You can still set this, but have to wait
\fIGrpCPURunMins=\fP
for future versions of SLURM before it is enforced.)
Used to limit the combined total number of CPU minutes used by all
jobs running with this association and its children. This takes into
consideration time limit of running jobs and consumes it, if the limit
is reached no new jobs are started until other jobs finish to allow
time to free up.
.TP
.TP
\fIGrpCPUs=\fP
\fIGrpCPUs=\fP
Maximum number of CPUs running jobs are able to be
Maximum number of CPUs running jobs are able to be
allocated in aggregate for this association and all associations which
allocated in aggregate for this association and all associations which
are children of this association. (NOTE: This limit is not currently
are children of this association.
enforced in SLURM. You can still set this, but have to wait for future
versions of SLURM before it is enforced.)
.TP
.TP
\fIGrpJobs=\fP
\fIGrpJobs=\fP
Maximum number of running jobs in aggregate for this
Maximum number of running jobs in aggregate for this
...
@@ -1538,16 +1538,18 @@ A brief description of the account.
...
@@ -1538,16 +1538,18 @@ A brief description of the account.
\fIGrpCPUMins=\fP
\fIGrpCPUMins=\fP
Maximum number of CPU hours running jobs are able to
Maximum number of CPU hours running jobs are able to
be allocated in aggregate for this association and all associations
be allocated in aggregate for this association and all associations
which are children of this association. (NOTE: This limit is not
which are children of this association.
currently enforced in SLURM. You can still set this, but have to wait
\fIGrpCPURunMins=\fP
for future versions of SLURM before it is enforced.)
Used to limit the combined total number of CPU minutes used by all
jobs running with this association and its children. This takes into
consideration time limit of running jobs and consumes it, if the limit
is reached no new jobs are started until other jobs finish to allow
time to free up.
.TP
.TP
\fIGrpCPUs=\fP
\fIGrpCPUs=\fP
Maximum number of CPUs running jobs are able to be
Maximum number of CPUs running jobs are able to be
allocated in aggregate for this association and all associations which
allocated in aggregate for this association and all associations which
are children of this association. (NOTE: This limit is not currently
are children of this association.
enforced in SLURM. You can still set this, but have to wait for future
versions of SLURM before it is enforced.)
.TP
.TP
\fIGrpJobs=\fP
\fIGrpJobs=\fP
Maximum number of running jobs in aggregate for this
Maximum number of running jobs in aggregate for this
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment