- Jul 29, 2016
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
Also we are making extern lower case in the api to match this and sacct.
-
Danny Auble authored
-
Danny Auble authored
that had a partition in them.
-
- Jul 28, 2016
-
-
Danny Auble authored
-
Morris Jette authored
Document that the SLURM_JOB_ACCOUNT and SLURM_JOB_QOS environment variables are set for the srun command in its man page. bug 2945
-
Morris Jette authored
Partially revert commit 4f8c62c1 salloc does not set SLURM_JOB_ACCOUNT or SLURM_JOB_QOS
-
Morris Jette authored
Document that the SLURM_JOB_ACCOUNT and SLURM_JOB_QOS environment variables are set for the salloc and sbatch commands in their man pages. bug 2945
-
- Jul 27, 2016
-
-
Morris Jette authored
Document that persistent burst buffers can not be created or destroyed using the salloc or srun --bb options. bug 2404
-
Brian Christiansen authored
Missed in b5bba34c
-
Danny Auble authored
on batch script completes.
-
Danny Auble authored
-
Danny Auble authored
code change.
-
- Jul 26, 2016
-
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Morris Jette authored
The problem only exists on a subset of KNL models, those with 68 cores. bug 2941
-
Danny Auble authored
-
Morris Jette authored
NOTE: Runaway jobs are jobs that don't exist in the controller but are still considered running in the datbase Should be NOTE: Runaway jobs are jobs that don't exist in the controller but are still considered running in the database
-
Danny Auble authored
(difference between start of job and when it was eligible).
-
- Jul 25, 2016
-
-
Danny Auble authored
-
David Gloe authored
Bug 2939.
-
Danny Auble authored
-
Morris Jette authored
This reverts commit fb8e3558 and moves the place where the SuspendExcNodes and SuspendExcParts configuration parameters are processed (needs to happen AFTER the partition and node tables in the slurmctld daemon are built. bug 2934
-
- Jul 23, 2016
-
-
Morris Jette authored
-
- Jul 22, 2016
-
-
Dominik Bartkiewicz authored
Inadvertently broken in commit 05eac196. Bug 2912.
-
Danny Auble authored
or failed based on the signal that would always be killing it.
-
Danny Auble authored
end of the job to do it.
-
Danny Auble authored
make them using the master job ID instead of the normal job ID.
-
- Jul 21, 2016
-
-
Morris Jette authored
-
Morris Jette authored
Treat invalid user ID in AllowUserBoot option of knl.conf file as error rather than fatal (log and do not exit).
-
- Jul 20, 2016
-
-
Morris Jette authored
Prevent slurmctld abort if job is killed or requeued while waiting for reboot of its allocated compute nodes. The _wait_boot() would reference job_ptr->node_bitmap, which would be NULL.
-
Boris Karasev authored
Bug 2908
-
Danny Auble authored
-
Tim Wickberg authored
Step hasn't been assigned resources, so the select_jobinfo struct hasn't yet been populated. Calling select_g_step_finish will dereference causing a segfault. Bug 2922.
-
- Jul 19, 2016
-
-
Morris Jette authored
-
Gennaro Oliva authored
-
Morris Jette authored
If the user is now allowed to use the partition, then do not check that user's group access again for 5 seconds. bug 2913
-
Morris Jette authored
Improve partition AllowGroups caching. Update the table of UIDs permitted to use a partition based upon it's AllowGroups configuration parameter as new valid UIDs are found rather than looking up that user's group information for every job they submit, which can involve considerable overhead for some systems. bug 2913
-