- Nov 07, 2014
-
-
David Bigagli authored
always and it scares people.
-
- Nov 06, 2014
-
-
Danny Auble authored
is requested. This is a re-factor of commit e5635a76 related to bug 1148 to handle the cases where a job could run, but an error was given when selecting the nodes.
-
Danny Auble authored
-
Danny Auble authored
lock was locked outside of the function or not. This also fixes a race condition when adding a QOS and planning on using it right away when the controller is busy with previous requests.
-
Danny Auble authored
PerCPU. Before it wasn't taking into account if the user was requesting per node memory or the job was told it needed to use less than the node allowed.
-
- Nov 05, 2014
-
-
Danny Auble authored
-
- Nov 04, 2014
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
This was an unrealized regression from commit 0da01963. The problem is we were clearing the job_ptr->job_resrcs too early. This patch fixes it to wait until the job is actually being requeued so it does the right thing.
-
- Oct 31, 2014
-
-
Danny Auble authored
pack it this way so we will not change it in 15.08
-
Danny Auble authored
-
Danny Auble authored
This isn't that big of an issue for 14.03, but 14.11 added more to this string which could overflow the buffer since sprintf is used instead of snprintf. Using xstrfmtcat fixes the issue and is easier to read code.
-
Danny Auble authored
-
Danny Auble authored
amount of tasks / number of node.
-
- Oct 30, 2014
-
-
David Bigagli authored
-
David Bigagli authored
to dereference it.
-
- Oct 27, 2014
-
-
Danny Auble authored
are specified. This is a fix to commit b9cc5b31 which just didn't know mc_ptr->ntasks_per_core is initialized to INFINITE. Without it the node_cnt packed would be set to 1 on the user tools. This fixes bug 1148.
-
Morris Jette authored
bug 1207
-
- Oct 24, 2014
-
-
Morris Jette authored
Use default_partition by default for the test
-
Morris Jette authored
Returned test passed, when it should have failed because of bad variable name.
-
David Singleton authored
We've seen slurmctld crashes due to negative job array indices.
-
- Oct 23, 2014
-
-
Morris Jette authored
The previous patch should work in most cases, but this should work more reliably and the comment is more clear bug 1196
-
David Bigagli authored
This reverts commit 7e65f924.
-
David Bigagli authored
-
Morris Jette authored
BGQ: Fix race condition when job fails due to hardware failure and is requeued. Previous code could result in slurmctld abort with NULL pointer. bug 1096
-
- Oct 22, 2014
-
-
Gennaro Oliva authored
-
- Oct 21, 2014
-
-
Morris Jette authored
Fix bug that prevented preservation of a job's GRES bitmap on slurmctld restart or reconfigure (bug was introduced in 14.03.5 "Clear record of a job's gres when requeued" and only applies when GRES mapped to specific files). bug 1192
-
- Oct 20, 2014
-
-
Danny Auble authored
-
David Bigagli authored
permission for the batch step.
-
David Bigagli authored
-
jette authored
Otherwise there will be no log file to write to, resulting in an abort bug 1185
-
- Oct 18, 2014
-
-
Nicolas Joly authored
-
- Oct 17, 2014
-
-
Morris Jette authored
-
Morris Jette authored
-
David Bigagli authored
-
David Bigagli authored
-
Morris Jette authored
Correct tracking of licenses for suspended jobs on slurmctld reconfigure or restart. Previously licenses for suspended jobs were not counted, so the license count could be exceeded with those jobs get resumed.
-
Danny Auble authored
commit a9dc50d4.
-
- Oct 16, 2014
-
-
Brian Christiansen authored
-