- Nov 27, 2013
-
-
Morris Jette authored
Original code worked only for Cray systems. For other systems it set gres_alloc to the total number of each GRES allocated on each node to any job
-
Morris Jette authored
-
Morris Jette authored
-
Jason Bacon authored
-
Morris Jette authored
-
- Nov 26, 2013
-
-
Chris Scheller authored
-
Morris Jette authored
-
Morris Jette authored
Logs errors related to apbasil use
-
Morris Jette authored
No change in logic, just move the logic that resets a batch job accounting information into its own function.
-
Morris Jette authored
-
- Nov 25, 2013
-
-
Danny Auble authored
-
Danny Auble authored
-
jette authored
No change in underlying logic
-
jette authored
This fixes a problem where a job contains a license that is removed in a slurmctld reconfiguration. Without this change, the job would be left with a non-zero license_list pointer referencing memory that had been freed bug 527
-
jette authored
Increase the range of possible reservation time values to allow for a really long RPC delay (possibly due to slurmctld fail over from primary to backup controller). Also change to a #define value for clarity bug 527
-
- Nov 24, 2013
- Nov 18, 2013
-
-
Morris Jette authored
The time/resource allocation matrix is rebuilt on each job exit, which severely impacts performance at large counts of running jobs (say >10k jobs).
-
- Nov 14, 2013
-
-
Morris Jette authored
bug 511
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- Nov 13, 2013
-
-
Morris Jette authored
-
Morris Jette authored
This makes it simpler to enable detailed debugging for reservations. This includes more information than we probably want to see with the DebugFlag=reservation and would be only for developer debugging
-
Morris Jette authored
This might have worked fine for core reservations or when there are sufficient idle nodes to use, the the select_g_resv_test() function clears the node bitmap for nodes that it can not use and the reservation create logic did not restore that bitmap after a failed resource selection attempt. This logic restores the node bitmap on a failed call to select_g_resv_test() so we can add nodes to the bitmap of available nodes rather than having it repeatedly cleared. The logic also adds some performance enhancements that I will add to in the next commit.
-
Morris Jette authored
-
- Nov 12, 2013
-
-
Danny Auble authored
on a task level if any task hit it the check will be triggered)
-
Danny Auble authored
-
Danny Auble authored
use mem and memsw failcnt, check for existence Thanks Ryan. I'll let you know how it goes.
-
- Nov 09, 2013
-
-
Ryan Cox authored
-
- Nov 08, 2013
-
-
Ryan Cox authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
task/cgroup: notify if memory.memsw.failcnt > 0. indicates oom Thanks Ryan, this looks correct.
-
Ryan Cox authored
-
- Nov 07, 2013
-
-
Morris Jette authored
-
- Nov 06, 2013
-
-
Gennaro Oliva authored
-
- Nov 05, 2013
-
-
Morris Jette authored
Correction to hostlist parsing bug introduced in v2.6.4 for hostlists with more than one numeric range in brackets (e.g. rack[0-3]_blade[0-63]"). bug505
-