- Nov 29, 2012
-
-
Morris Jette authored
-
Morris Jette authored
-
- Nov 28, 2012
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
you query against that with -N and -E you will get all jobs during that time instead of only the ones running on -N. Signed-off-by:
Danny Auble <da@schedmd.com>
-
Morris Jette authored
-
Morris Jette authored
-
- Nov 27, 2012
-
-
Morris Jette authored
-
Morris Jette authored
to a list, which can be arbitrary size
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Nathan Yee authored
-
Nathan Yee authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
was already in error and isn't deallocating and underlying hardware goes bad one could get overlapping blocks in error making the code assert when a new job request comes in.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
overcommit.
-
Danny Auble authored
overcommit.
-
Morris Jette authored
Previously only requeued the job once
-
Morris Jette authored
-
Morris Jette authored
-
- Nov 26, 2012
-
-
Danny Auble authored
-
https://github.com/SchedMD/slurmjette authored
-
jette authored
Without this change, the nodes associated with a reservation would only be updated if the partition's nodes were reset using the "scontrol update partition" command, but not if they were reset using "scontrol reconfigure"
-
Danny Auble authored
written by other cpus sharing the package
-
Danny Auble authored
where needed)
-
jette authored
This reverts most of commit https://github.com/SchedMD/slurm/commit/570941362ffdc57e9e3d4723bc4f728ae04789d8 and adds a call from slurmctld to srun prior to deallocating nodes and notifying slurmd to cancel the tasks
-
https://github.com/SchedMD/slurmjette authored
-
jette authored
Otherwise an aborted slurmstepd can cause the srun process to hang indefinitely; a problem reported in trouble ticket 149.
-
Morris Jette authored
-
Morris Jette authored
If the slurmstepd connects task I/O, but aborts after srun accepts the connect and before slurmstepd writes data then srun could possibly hand indefinitely. This probably does not explain failures seen at CEA, but can't hurt matters. then the sr
-
- Nov 25, 2012