- Sep 05, 2003
-
-
Moe Jette authored
-
Moe Jette authored
occur naturally if a srun, scontrol, scancel, sinfo, or squeue command is killed by the user with a communication to slurmctld in progress. This seems to occur fairly regularly as part of batch job termination.
-
Moe Jette authored
-
Moe Jette authored
send/receive, function (poll, timeout, send, recv, etc), and the error message are all reported.
-
- Sep 04, 2003
- Sep 03, 2003
-
-
Moe Jette authored
It was picking zero nodes and failing.
-
Moe Jette authored
-
Mark Grondona authored
problem when debugging remote tasks. (and error should have only printed once anyway)
-
- Sep 02, 2003
-
-
Mark Grondona authored
not SIGXCPU on reaching timelimit.
-
- Aug 20, 2003
-
-
jwindley authored
-
- Aug 14, 2003
- Aug 13, 2003
-
-
Mark Grondona authored
-
Moe Jette authored
to match that of a job's run time (TIME).
-
Mark Grondona authored
-
Moe Jette authored
hand if race condition starting all daemons).
-
Moe Jette authored
-
Mark Grondona authored
-
Moe Jette authored
-
Mark Grondona authored
o Export SLURM_NODELIST environment var in all cases
-
Mark Grondona authored
o Changed correspondingly calls to above o Removed unnecessary envc member of job structure, along with references to said member
-
Moe Jette authored
-
Moe Jette authored
-
Mark Grondona authored
is through with them. o added new function unsetenvp() to support above
-
Moe Jette authored
the old_job_allocation response so as to conform with job step's specification. For example, remove nodes that are excluded (-x), set the node count to that requested (-N, eliminate nodes as required), etc. This requires modifying the node list including the slurm_addr, and cpu_count arrays.
-
- Aug 12, 2003
- Aug 08, 2003
- Aug 07, 2003
-
-
Moe Jette authored
By default, logging is to syslog. Expanded usage description to be more complete.
-
Moe Jette authored
-
Moe Jette authored
without duplicates. For example, we want to convert a users request for nodes "lx1,lx2,lx1,lx3" to "lx[1-3]".
-
Moe Jette authored
entry counting function. As a job is completing the bitmap gets cleared as nodes complete, resulting in incorrect values.
-
Moe Jette authored
-
- Aug 06, 2003
-
-
Moe Jette authored
-
- Aug 05, 2003