Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
9770d90b
Commit
9770d90b
authored
18 years ago
by
Moe Jette
Browse files
Options
Downloads
Patches
Plain Diff
Minor changes to formatting and some verbage.
parent
11e67844
No related branches found
Branches containing commit
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
doc/man/man1/scontrol.1
+71
-34
71 additions, 34 deletions
doc/man/man1/scontrol.1
with
71 additions
and
34 deletions
doc/man/man1/scontrol.1
+
71
−
34
View file @
9770d90b
...
@@ -50,59 +50,81 @@ record counts, etc.
...
@@ -50,59 +50,81 @@ record counts, etc.
Print version information and exit.
Print version information and exit.
.TP
.TP
\fBCOMMANDS\fR
\fBCOMMANDS\fR
.TP
.TP
\f
I
all\fP
\f
B
all\fP
Show all partitiion, their jobs and jobs steps. This causes information to be
Show all partitiion, their jobs and jobs steps. This causes information to be
displayed about partitions that are configured as hidden and partitions that
displayed about partitions that are configured as hidden and partitions that
are unavailable to user's group.
are unavailable to user's group.
.TP
.TP
\f
I
abort\fP
\f
B
abort\fP
Instruct the Slurm controller to terminate immediately and generate a core file.
Instruct the Slurm controller to terminate immediately and generate a core file.
.TP
.TP
\f
I
checkpoint\fP \fICKPT_OP\fP \fIID\fP
\f
B
checkpoint\fP \fICKPT_OP\fP \fIID\fP
Perform a checkpoint activity on the job step(s) with the specified identification.
Perform a checkpoint activity on the job step(s) with the specified identification.
\fICKPT_OP\fP may be
\fIID\fP can be used to identify a specific job (e.g. "<job_id>",
\fIdisable\fP (disable future checkpoints),
which applies to all of its existing steps)
\fIenable\fP (enable future checkpoints),,
\fIable\fP (test if presently not disabled, report start time if checkpoint in progress),
\fIcreate\fP (create a checkpoint and continue the job step),
\fIvacate\fP (create a checkpoint and terminate the job step),
\fIerror\fP (report the result for the last checkpoint request, error code and message), or
\fIrestart\fP (restart execution of the previously checkpointed job steps).
\fIID\fP can be used to identify a specific job (e.g. "<job_id>",
which applies to all of its existing steps)
or a specific job step (e.g. "<job_id>.<step_id>").
or a specific job step (e.g. "<job_id>.<step_id>").
Acceptable values for \fICKPT_OP\fP include:
.RS
.TP
\fIdisable\fP (disable future checkpoints)
.TP
\fIenable\fP (enable future checkpoints)
.TP
\fIable\fP (test if presently not disabled, report start time if checkpoint in progress)
.TP
.TP
\fIcompleting\fP
\fIcreate\fP (create a checkpoint and continue the job step)
.TP
\fIvacate\fP (create a checkpoint and terminate the job step)
.TP
\fIerror\fP (report the result for the last checkpoint request, error code and message)
.TP
\fIrestart\fP (restart execution of the previously checkpointed job steps)
.RE
.TP
\fBcompleting\fP
Display all jobs in a COMPLETING state along with associated nodes in either a
Display all jobs in a COMPLETING state along with associated nodes in either a
COMPLETING or DOWN state.
COMPLETING or DOWN state.
.TP
.TP
\f
I
delete\fP \fISPECIFICATION\fP
\f
B
delete\fP \fISPECIFICATION\fP
Delete the entry with the specified \fISPECIFICATION\fP.
Delete the entry with the specified \fISPECIFICATION\fP.
The only supported \fISPECIFICATION\fP presently is of the form
The only supported \fISPECIFICATION\fP presently is of the form
\fIPartitionName=<name>\fP.
\fIPartitionName=<name>\fP.
.TP
.TP
\f
I
exit\fP
\f
B
exit\fP
Terminate the execution of scontrol.
Terminate the execution of scontrol.
This is an independent command with no options meant for use in interactive mode.
.TP
.TP
\f
I
help\fP
\f
B
help\fP
Display a description of scontrol options and commands.
Display a description of scontrol options and commands.
.TP
.TP
\f
I
hide\fP
\f
B
hide\fP
Do not display partitiion, job or jobs step information for partitions that are
Do not display partitiion, job or jobs step information for partitions that are
configured as hidden or partitions that are unavailable to the user's group.
configured as hidden or partitions that are unavailable to the user's group.
This is the default behavior.
This is the default behavior.
.TP
.TP
\f
I
oneliner\fP
\f
B
oneliner\fP
Print information one line per record.
Print information one line per record.
This is an independent command with no options meant for use in interactive mode.
.TP
.TP
\f
I
pidinfo\fP \fIPROC_ID\fP
\f
B
pidinfo\fP \fIPROC_ID\fP
Print the Slurm job id and scheduled termination time corresponding to the
Print the Slurm job id and scheduled termination time corresponding to the
supplied process id, \fIPROC_ID\fP, on the current node. This will work only
supplied process id, \fIPROC_ID\fP, on the current node. This will work only
with processes on node on which scontrol is run, and only for those processes
with processes on node on which scontrol is run, and only for those processes
spawned by SLURM and their descendants.
spawned by SLURM and their descendants.
.TP
.TP
\f
I
listpids\fP [JOBID[.STEPID]] [NodeName]
\f
B
listpids\fP [JOBID[.STEPID]] [NodeName]
Print a listing of the process IDs in a job step (if JOBID.STEPID is provided),
Print a listing of the process IDs in a job step (if JOBID.STEPID is provided),
or all of the job steps in a job (if JOBID is provided), or all of the job
or all of the job steps in a job (if JOBID is provided), or all of the job
steps in all of the jobs on the local node (if JOBID is not provided or JOBID
steps in all of the jobs on the local node (if JOBID is not provided or JOBID
...
@@ -114,19 +136,24 @@ are unable to identify all processes associated with a job or job step.
...
@@ -114,19 +136,24 @@ are unable to identify all processes associated with a job or job step.
Note that the NodeName option is only really useful when you have multiple
Note that the NodeName option is only really useful when you have multiple
slurmd daemons running on the same host machine. Multiple slurmd daemons on
slurmd daemons running on the same host machine. Multiple slurmd daemons on
one host are, in general, only used by SLURM developers.
one host are, in general, only used by SLURM developers.
.TP
.TP
\f
I
ping\fP
\f
B
ping\fP
Ping the primary and secondary slurmctld daemon and report if
Ping the primary and secondary slurmctld daemon and report if
they are responding.
they are responding.
.TP
.TP
\f
I
quiet\fP
\f
B
quiet\fP
Print no warning or informational messages, only fatal error messages.
Print no warning or informational messages, only fatal error messages.
This is an independent command with no options meant for use in interactive mode.
This is an independent command with no options meant for use in interactive mode.
.TP
.TP
\f
I
quit\fP
\f
B
quit\fP
Terminate the execution of scontrol.
Terminate the execution of scontrol.
This is an independent command with no options meant for use in interactive mode.
.TP
.TP
\f
I
reconfigure\fP
\f
B
reconfigure\fP
Instruct all Slurm daemons to re\-read the configuration file.
Instruct all Slurm daemons to re\-read the configuration file.
This command does not restart the daemons.
This command does not restart the daemons.
This mechanism would be used to modify configuration parameters (Epilog,
This mechanism would be used to modify configuration parameters (Epilog,
...
@@ -140,14 +167,17 @@ however, SLURM daemons should be shutdown and restarted if any of these
...
@@ -140,14 +167,17 @@ however, SLURM daemons should be shutdown and restarted if any of these
parameters are to be changed: AuthType, BackupAddr, BackupController,
parameters are to be changed: AuthType, BackupAddr, BackupController,
ControlAddr, ControlMach, PluginDir, StateSaveLocation, SlurmctldPort
ControlAddr, ControlMach, PluginDir, StateSaveLocation, SlurmctldPort
or SlurmdPort.
or SlurmdPort.
.TP
.TP
\f
I
resume\fP \fIjob_id\fP
\f
B
resume\fP \fIjob_id\fP
Resume a previously suspended job.
Resume a previously suspended job.
.TP
.TP
\f
I
requeue\fP \fIjob_id\fP
\f
B
requeue\fP \fIjob_id\fP
Requeue a running or pending SLURM batch job.
Requeue a running or pending SLURM batch job.
.TP
.TP
\f
I
show\fP \fIENTITY\fP \fIID\fP
\f
B
show\fP \fIENTITY\fP \fIID\fP
Display the state of the specified entity with the specified identification.
Display the state of the specified entity with the specified identification.
\fIENTITY\fP may be \fIconfig\fP, \fIdaemons\fP, \fIjob\fP, \fInode\fP,
\fIENTITY\fP may be \fIconfig\fP, \fIdaemons\fP, \fIjob\fP, \fInode\fP,
\fIpartition\fP or \fIstep\fP.
\fIpartition\fP or \fIstep\fP.
...
@@ -159,20 +189,23 @@ Multiple node names may be specified using simple node range expressions
...
@@ -159,20 +189,23 @@ Multiple node names may be specified using simple node range expressions
(e.g. "lx[10\-20]"). All other \fIID\fP values must identify a single
(e.g. "lx[10\-20]"). All other \fIID\fP values must identify a single
element. The job step ID is of the form "job_id.step_id", (e.g. "1234.1").
element. The job step ID is of the form "job_id.step_id", (e.g. "1234.1").
By default, all elements of the entity type specified are printed.
By default, all elements of the entity type specified are printed.
.TP
.TP
\f
I
shutdown\fP
\f
B
shutdown\fP
Instruct all Slurm daemons to save current state and terminate.
Instruct all Slurm daemons to save current state and terminate.
The Slurm controller (slurmctld) forwards the request all other daemons
The Slurm controller (slurmctld) forwards the request all other daemons
(slurmd daemon on each compute node).
(slurmd daemon on each compute node).
.TP
.TP
\f
I
suspend\fP \fIjob_id\fP
\f
B
suspend\fP \fIjob_id\fP
Suspend a running job.
Suspend a running job.
Use the \fIresume\fP command to resume its execution.
Use the \fIresume\fP command to resume its execution.
User processes must stop on receipt of SIGSTOP signal and resume
User processes must stop on receipt of SIGSTOP signal and resume
upon receipt of SIGCONT for this operation to be effective.
upon receipt of SIGCONT for this operation to be effective.
Not all architectures and configurations support job suspension.
Not all architectures and configurations support job suspension.
.TP
.TP
\f
I
update\fP \fISPECIFICATION\fP
\f
B
update\fP \fISPECIFICATION\fP
Update job, node or partition configuration per the supplied specification.
Update job, node or partition configuration per the supplied specification.
\fISPECIFICATION\fP is in the same format as the Slurm configuration file
\fISPECIFICATION\fP is in the same format as the Slurm configuration file
and the output of the \fIshow\fP command described above. It may be desirable
and the output of the \fIshow\fP command described above. It may be desirable
...
@@ -184,17 +217,21 @@ particular, the hardware configuration of a node or the physical addition or
...
@@ -184,17 +217,21 @@ particular, the hardware configuration of a node or the physical addition or
removal of nodes from the cluster may only be accomplished through editing
removal of nodes from the cluster may only be accomplished through editing
the Slurm configuration file and executing the \fIreconfigure\fP command
the Slurm configuration file and executing the \fIreconfigure\fP command
(described above).
(described above).
.TP
.TP
\f
I
verbose\fP
\f
B
verbose\fP
Print detailed event logging.
Print detailed event logging.
This includes time\-stamps on data structures, record counts, etc.
This includes time\-stamps on data structures, record counts, etc.
This is an independent command with no options meant for use in interactive mode.
This is an independent command with no options meant for use in interactive mode.
.TP
.TP
\f
I
version\fP
\f
B
version\fP
Display the version number of scontrol being executed.
Display the version number of scontrol being executed.
.TP
.TP
\f
I
!!\fP
\f
B
!!\fP
Repeat the last command executed.
Repeat the last command executed.
.TP
.TP
\fBSPECIFICATIONS FOR SHOW AND UPDATE COMMANDS, JOBS\fR
\fBSPECIFICATIONS FOR SHOW AND UPDATE COMMANDS, JOBS\fR
.TP
.TP
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment