Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
1ef7c0c6
Commit
1ef7c0c6
authored
10 years ago
by
Brian Christiansen
Browse files
Options
Downloads
Patches
Plain Diff
Update scheduling configuration documenation.
parent
df531529
No related branches found
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
doc/html/sched_config.shtml
+44
-10
44 additions, 10 deletions
doc/html/sched_config.shtml
with
44 additions
and
10 deletions
doc/html/sched_config.shtml
+
44
−
10
View file @
1ef7c0c6
...
@@ -5,9 +5,18 @@
...
@@ -5,9 +5,18 @@
<H2>Overview</H2>
<H2>Overview</H2>
<P>Slurm is designed to perform a quick and simple scheduling attempt at
<P>Slurm is designed to perform a quick and simple scheduling attempt at
frequent intervals (e.g. at the time of each job submission or completion).
events such as job submission or completion and configuration changes.
A more comprehensive scheduling attempt is made less frequently, typically
During these event-triggered scheduling events, <b>default_queue_depth</b>
by the backfill scheduling plugin.</P>
(default is 100) number of jobs will be considered.</P>
<P>At less frequent intervals, defined by <b>sched_interval</b>, all jobs will
be considered for scheduling.</P>
<P>In either case, once any job or job array task in a partition is left
pending, no other jobs in that partition will be scheduled.</P>
<P>A more comprehensive scheduling attempt is typically done by the backfill
scheduling plugin.</P>
<H2>Scheduling Configuration</H2>
<H2>Scheduling Configuration</H2>
...
@@ -19,7 +28,10 @@ each partition/queue.</P>
...
@@ -19,7 +28,10 @@ each partition/queue.</P>
<P>There is also a <B>SchedulerParameters</B> configuration parameter which
<P>There is also a <B>SchedulerParameters</B> configuration parameter which
can specify a wide range of parameters as described below.
can specify a wide range of parameters as described below.
This first set of parameters applies to all scheduling configurations.</P>
This first set of parameters applies to all scheduling configurations.
See the <a href="slurm.conf.html">slurm.conf(5)</a> man page for more details.
</P>
<UL>
<UL>
<LI><B>default_queue_depth=#</B> - Specifies the number of jobs to consider for
<LI><B>default_queue_depth=#</B> - Specifies the number of jobs to consider for
scheduling on each event that may result in a job being scheduled.
scheduling on each event that may result in a job being scheduled.
...
@@ -29,11 +41,16 @@ small number is generally best.</LI>
...
@@ -29,11 +41,16 @@ small number is generally best.</LI>
Can be useful for high-throughput computing.</LI>
Can be useful for high-throughput computing.</LI>
<LI><B>max_switch_wait=#</B> - Specifies the maximum time a job can wait for
<LI><B>max_switch_wait=#</B> - Specifies the maximum time a job can wait for
desired number of leaf switches. Default value is 300 seconds.</LI>
desired number of leaf switches. Default value is 300 seconds.</LI>
<LI><B>partition_job_depth=#</B> - Specifies how many jobs are tested in any
single partition, default value is 0 (no limit).</LI>
<LI><B>sched_interval=#</B> - Specifies how frequently, in seconds, the main
scheduling loop will execute and test all pending jobs. The default value is
60 seconds.</LI>
</UL>
</UL>
<H2>Backfill Scheduling</H2>
<H2>Backfill Scheduling</H2>
<P>
B
ackfill scheduling plugin is loaded by default.
<P>
The b
ackfill scheduling plugin is loaded by default.
Without backfill scheduling, each partition is scheduled strictly in priority
Without backfill scheduling, each partition is scheduled strictly in priority
order, which typically results in significantly lower system utilization and
order, which typically results in significantly lower system utilization and
responsiveness than otherwise possible.
responsiveness than otherwise possible.
...
@@ -53,7 +70,10 @@ memory requirements, etc.
...
@@ -53,7 +70,10 @@ memory requirements, etc.
If the job under consideration can start immediately without impacting the
If the job under consideration can start immediately without impacting the
expected start time of any higher priority job, then it does so.
expected start time of any higher priority job, then it does so.
Otherwise the resources required by the job will be reserved during the job's
Otherwise the resources required by the job will be reserved during the job's
expected execution time.</P>
expected execution time.
The backfill plugin will set the expected start time for pending jobs. A job's
expected start time can be seen using the <b>squeue --start</b> command.
</P>
<P>Backfill scheduling is difficult without reasonable time limit estimates
<P>Backfill scheduling is difficult without reasonable time limit estimates
for jobs, but some configuration parameters that can help.</P>
for jobs, but some configuration parameters that can help.</P>
...
@@ -72,7 +92,10 @@ and ignore newly submitted jobs (<B>SchedulerParameters=bf_continue</B>).
...
@@ -72,7 +92,10 @@ and ignore newly submitted jobs (<B>SchedulerParameters=bf_continue</B>).
Doing so will permit consideration of more jobs, but may result in the delayed
Doing so will permit consideration of more jobs, but may result in the delayed
scheduling of newly submitted jobs.
scheduling of newly submitted jobs.
A list of <B>SchedulerParameters</B> configuration parameters related to
A list of <B>SchedulerParameters</B> configuration parameters related to
backfill scheduling follows.</P>
backfill scheduling follows.
See the <a href="slurm.conf.html">slurm.conf(5)</a> man page for more details.
</P>
<UL>
<UL>
<LI><B>bf_continue</B> - If set, then continue backfill scheduling after
<LI><B>bf_continue</B> - If set, then continue backfill scheduling after
periodically releasing locks for other operations.</LI>
periodically releasing locks for other operations.</LI>
...
@@ -91,10 +114,21 @@ Default value is 60 seconds.
...
@@ -91,10 +114,21 @@ Default value is 60 seconds.
Larger values are appropriate if job time limits are imprecise and/or
Larger values are appropriate if job time limits are imprecise and/or
small delays in starting pending jobs in order to achieve higher system
small delays in starting pending jobs in order to achieve higher system
utilization is desired.</LI>
utilization is desired.</LI>
<LI><B>bf_window=#</B> - How long into the future to look when determining
<LI><B>bf_window=#</B> - How longin, in minutes, into the future to look when
when and where jobs can start. Default value is one day.</LI>
determining when and where jobs can start. Default value is one day.</LI>
<LI><B>bf_yield_interval=#</B> -
The backfill scheduler will periodically relinquish locks in order for other
pending operations to take place. This specifies the times when the locks are
relinquish in microseconds. The default value is 2,000,000 microseconds
(2 seconds). Smaller values may be helpful for high throughput computing when
used in conjunction with the bf_continue option.</LI>
<LI><B>bf_yield_sleep=#</B> -
The backfill scheduler will periodically relinquish locks in order for other
pending operations to take place. This specifies the length of time for which
the locks are relinquish in microseconds. The default value is 500,000
microseconds (0.5 seconds). </LI>
</UL>
</UL>
<p style="text-align:center;">Last modified
27 March
201
4
</p>
<p style="text-align:center;">Last modified
3 February
201
5
</p>
<!--#include virtual="footer.txt"-->
<!--#include virtual="footer.txt"-->
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment