- Oct 01, 2015
-
-
Morris Jette authored
This required a fairly major re-write of the select plugin logic bug 1975
-
- Sep 30, 2015
-
-
Morris Jette authored
Correct some cgroup paths ("step_batch" vs. "step_4294967294", "step_exter" vs. "step_extern", and "step_extern" vs. "step_4294967295").
-
Morris Jette authored
Document that if a job's memory per CPU limit exceeds the system limit, that the job's memory limit is decreased and it's CPU count increased automatically.
-
Brian Christiansen authored
Conflicts: NEWS src/slurmctld/job_mgr.c src/srun/libsrun/launch.c
-
Brian Christiansen authored
Continuation of 1252d1a1 Bug 1938
-
Morris Jette authored
Requeue/hold batch job launch request if job already running. This is possible if node went to DOWN state, but jobs remained active. In addition, if a prolog/epilog failed DRAIN the node rather than setting it down, which could kill jobs that could continue to run. bug 1985
-
- Sep 29, 2015
-
-
Morris Jette authored
This makes srun more consistent with salloc and sbatch
-
Morris Jette authored
Previous logic would not report termiation siganl, only exit code, which could be meaningless.
-
Brian Christiansen authored
Bug 1938
-
Brian Christiansen authored
Bug 1984
-
- Sep 28, 2015
-
-
Morris Jette authored
When nodes have been allocated to a job and then released by the job while resizing, this patch prevents the nodes from continuing to appear allocated and unavailable to other jobs. Requires exclusive node allocation to trigger. This prevents the previously reported failure, but a proper fix will be quite complex and delayed to the next major release of Slurm (v 16.05). bug 1851
-
Morris Jette authored
When nodes have been allocated to a job and then released by the job while resizing, this patch prevents the nodes from continuing to appear allocated and unavailable to other jobs. Requires exclusive node allocation to trigger. This prevents the previously reported failure, but a proper fix will be quite complex and delayed to the next major release of Slurm (v 16.05). bug 1851
-
Gennaro Oliva authored
-
Morris Jette authored
Optimizing topology takes place first, then picking lowest weight nodes within the switches offering the best fit. bug 1979
-
- Sep 25, 2015
-
-
Koji Tanaka authored
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Add ability to change a job array's maximum running task count: "scontrol update jobid=# arraytaskthrottle=#" bug 1863
-
Morris Jette authored
-
Morris Jette authored
Added as part of requeue/hold update
-
Morris Jette authored
-
- Sep 24, 2015
-
-
Morris Jette authored
Was printing "Name=#" rather than "JobID=#"
-
Danny Auble authored
-
Nathan Yee authored
Validate that sbatch, srun, salloc return partition error message on invalid partition name. bug 1223
-
Danny Auble authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
option.
-
Danny Auble authored
we are root or slurmuser. This was hiding a bug that will be fixed in the next commit.
-
Gennaro Oliva authored
-
Morris Jette authored
Previous logic would stop at "/".
-
Nathan Yee authored
bug 1228
-
Morris Jette authored
Modify scontrol requeue and requeue_hold commands to accept comma delimited list of job IDs. bug 1929
-
Morris Jette authored
Previously, scontrol would generate an error if passed a comma delimited list of job IDs. A space delimited list would be accepted. This increases compatability with some other Slurm commands. bug 1929
-
- Sep 23, 2015
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-