- Jul 18, 2014
-
-
Bill Brophy authored
uses SelectType=select/cons_res otherwise don't send the request to slurmctld and return an error.
-
Morris Jette authored
Correct NumCPUs count for jobs with --exclusive option. bug 909
-
Morris Jette authored
This probably only happens on native Cray systems due to the deallocation delays related to node health check. In any case, the symptom is error message of this sort "job # dealloc of node ... bad node_offset 0 count is 0". It then fails to deallocate the nodes GRES back for use by other jobs. bug 973
-
Danny Auble authored
slurm_conf_reinit.
-
Danny Auble authored
counting as multiple nodes.
-
- Jul 17, 2014
-
-
Morris Jette authored
-
Gennaro Oliva authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
David Bigagli authored
slurmstepd attempts to create it, for example left over from a previous requeue or crash, delete it and recreate it. #961.
-
- Jul 16, 2014
-
-
David Bigagli authored
-
Morris Jette authored
-
Morris Jette authored
switch/nrt - Unload job tables (in addition to windows) in user space mode to avoid leaking NRT. bug 964
-
Morris Jette authored
Test for NULL array pointers before trying to free the array elements. We are not seeing problems with this, but just hardening the code.
-
- Jul 15, 2014
-
-
Morris Jette authored
Fix race condition which could result in requeue if batch job exit and node registration occur at the same time.
-
Danny Auble authored
-
Danny Auble authored
(From that commit) There was a problem when building from source where for example @bindir@ would resolve to ${prefix}/bin. This patch fixes it, based on http://www.gnu.org/software/autoconf/manual/ autoconf-2.69/html_node/Installation-Directory-Variables.html It also changes opt_modulefiles_slurm to opt_modulefiles_slurm.in but I couldn't figure out how to get git diff to show that.
-
- Jul 14, 2014
-
-
David Bigagli authored
and should there be problems accessing the state files.
-
Morris Jette authored
Note that the map/mask specified applies to all allocated nodes.
-
Morris Jette authored
Fix for possible abort on change in GRES configuration. bug 958
-
Morris Jette authored
-
- Jul 11, 2014
-
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
Remi Palancher authored
-
Remi Palancher authored
(commit 4cd63575) with sacctmgr load when Parent has "'" around it
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
- Jul 10, 2014
-
-
Danny Auble authored
-
Danny Auble authored
-
Nathan Yee authored
page.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
special characters in their names like ':'
-
Morris Jette authored
It seems to have been broken for some time due to logic blocking SIGWINCH being removed from srun, which could result in the signal being sent to any thread, not the thread designed to handle window resizing.
-
Morris Jette authored
There was an error reported by slurmstepd trying to read the window size from srun when srun was exiting. This eliminates that spurious error message.
-