Skip to content
Snippets Groups Projects
Commit 5531a249 authored by Danny Auble's avatar Danny Auble Committed by Alejandro Sanchez
Browse files

Docs - Correct description of Overcommitted in sreport


Continuation of c91a42c3.

Bug 10240

Co-authored-by: default avatarBen Roberts <ben@schedmd.com>
parent c91a42c3
No related branches found
No related tags found
No related merge requests found
......@@ -189,6 +189,13 @@ UserUtilizationByWckey, Utilization, WCKeyUtilizationByUser
.B user
\- TopUsage
.LP
\fBNOTE\fR: If \fBOverSubscribe\fR is configured to \fBFORCE\fR or \fBYES\fR
in your slurm.conf and the system is not configured to use preemption
(\fBPreemptMode=OFF\fR) accounting can easily grow to values greater than
the actual utilization. It may be common on such systems to get error messages
in the slurmdbd log stating: "We have more allocated time than is possible."
.SH "REPORT DESCRIPTION"
.TP
......@@ -229,14 +236,16 @@ responding (assuming TrackSlurmctldDown is set in slurmdbd.conf).
.IP Idle
Time where nodes had no active jobs or reservations.
.IP Overcommited
If there is enough utilization to consume more system time than is
expected, the extra usage is recorded as Overcommitted.
Time of eligible jobs waiting in the queue over the Reserved time. Unlike
Reserved, this has no limit. It is typically useful to determine whether your
system is overloaded and by how much.
.IP PlannedDown
Time that nodes were in use by a reservation created with the MAINT flag
but not the IGNORE_JOBS flag.
.IP Reserved
Time that a node spent idle with eligible jobs in the queue that were
unable to start due to time or size constraints. If this value is
unable to start due to time or size constraints. The maximum time is limited
to the amount of time possible on the system. If this value is
not of importance for you then the number can be grouped with idle time.
.IP TresCount
Configured number of TRES' on the reported nodes. See also the
......
......@@ -5210,6 +5210,13 @@ will prevent oversubscription from happening due to preemption as well.
\fBFORCE\fR that is greater than 1. For example, \fBOverSubscribe=FORCE:2\fR
will permit two jobs per resource normally, but a third job can be started
only if done so through preemption based upon QOS.
\fBNOTE\fR: If \fBOverSubscribe\fR is configured to \fBFORCE\fR or \fBYES\fR
in your slurm.conf and the system is not configured to use preemption
(\fBPreemptMode=OFF\fR) accounting can easily grow to values greater than
the actual utilization. It may be common on such systems to get error messages
in the slurmdbd log stating: "We have more allocated time than is possible."
.TP
\fBYES\fR
Makes all resources in the partition available for sharing upon request by
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment