- Nov 01, 2013
-
-
Morris Jette authored
Add argument to priority plugin's priority_p_reconfig function to note when the association and QOS used_cpu_run_secs field has been reset. Without this flag, we remove time on "scontrol setdebug" or "scontrol setdebugflag" that can result in used_cpu_run_secs going negative or otherwise get bad values. Correction to logic added in commit 6d793189 bug 423
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Fix to work with change logic introduced in Slurm version 2.6.3 scheduling logic which prevented Maui/Moab from starting jobs.
-
- Oct 30, 2013
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- Oct 29, 2013
-
-
David Bigagli authored
-
Bill Brophy authored
-
Morris Jette authored
Add support for -W block=true (wait for job completion) Clear PBS_NODEFILE environment variable Credit to NCSC
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
limits
-
- Oct 28, 2013
-
-
Morris Jette authored
Add support for dependencies of job array elements (e.g. "sbatch --depend=afterok:123_4 ..."). This does not support depenendencies of ALL job array elements, only individual job array elements.
-
Morris Jette authored
-
Danny Auble authored
preforms the allocation, regression in 2.6.3.
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
- Oct 25, 2013
-
-
Morris Jette authored
Previously the SlurmdDebug value in slurm.conf was ignored if the previous value was not 3/init
-
Morris Jette authored
Reorder some logis in the hostlist functions for performance improvement specifically for "if (A & B) ..." move the fastest tests first (test A should take less time than test B).
-
Morris Jette authored
This avoids building hostlist information with NodeHostName and NodeAddr information unless explisitly requested and can improve performance for the default mode of operation by about 65%.
-
Morris Jette authored
Correct sbatch documentation and job_submit/pbs plugin "%j" is job ID, not "%J" (which is job_id.step_id).
-
- Oct 24, 2013
-
-
Morris Jette authored
Specifically setting innodb_buffer_pool_size=64 in my.conf
-
Morris Jette authored
Without this change a job with a reason of WAIT_PART_DOWN, WAIT_PART_INACTIVE, WAIT_PART_NODE_LIMIT, WAIT_PART_TIME_LIMIT, or WAIT_QOS_THRES would not be cleared when that reason no longer applied.
-
Morris Jette authored
-
- Oct 22, 2013
-
-
Morris Jette authored
Add cgroup create retry logic in case one step is starting at the same time as another step is ending and the logic to create and delete cgroups overlaps. bug 447
-
Dave Henseler authored
-
Morris Jette authored
I did the merge improperly
-
-
Morris Jette authored
If a node has GRES and multiple threads per core the select/cons_res plugin can get stuck in an infinite loop. See bug 475 Contributed by: PREVOST Ludovic NEC HPC Europe
-
Morris Jette authored
-
Morris Jette authored
-
Thomas Cadeau authored
If slurmd fails to get IPMI value, then I propose to force to wait 1 second instead of asking BMC again. (Part 3/4 of the patch). If IPMI init fails when slurmd forces to update the value, then we should not update the value. (Part 4/4 of the patch Part 1/4 and 2/4 add a security in IPMI init because the function can be call several time. This force to return SLURM_FAILURE if the first call failed, since the other call will not do anything. bug 469
-
Morris Jette authored
Previously a node failure would always requeue the job
-
- Oct 21, 2013
-
-
Morris Jette authored
Restore default behavior of allocating cores to jobs on a cyclic basis across the sockets unless SelectTypeParameters=CR_CORE_DEFAULT_DIST_BLOCK or user specifies other distribution options. Reverts commit 7fcdc7e5 bug 466
-
- Oct 20, 2013
-
-
jette authored
Change Sockets to SocketsPerBoard and Procs to CPUs
-
jette authored
If the backfill scheduler relinquishes locks and the normal job scheduler starts a job that the backfill scheduler was actively working, the backfill scheduler will try to re-schedule that same job, possibly resulting in an invalid memory reference or other badness.
-