- Jul 13, 2012
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Conflicts: doc/html/high_throughput.shtml
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Without this change the values known to slurmctld do not change without a cold-start.
-
- Jul 12, 2012
-
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
than 1 midplane but not the entire allocation.
-
Danny Auble authored
multi midplane block allocation.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
where other blocks on an overlapping midplane are running jobs.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- Jul 11, 2012
-
-
Danny Auble authored
-
Danny Auble authored
hardware is marked bad remove the larger block and create a block over just the bad hardware making the other hardware available to run on.
-
Morris Jette authored
-
Danny Auble authored
allocation.
-
Danny Auble authored
-
Danny Auble authored
for a job to finish on it the number of unused cpus wasn't updated correctly.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
same type and network ID. Add logic to match adapter name also. This is needed due to the additional IP_ONLY adapter named virbr0 as used for virtualization.
-
Morris Jette authored
-
Morris Jette authored
-
- Jul 10, 2012
-
-
Danny Auble authored
-
Morris Jette authored
When using the jobcomp/script interface, we have noticed the NODECNT environment variable is off-by-one when logging completed jobs in the NODE_FAIL state (though the NODELIST is correct). This appears to be because in many places in job_completion_logger() is called after deallocate_nodes(), which appears to decrement job->node_cnt for DOWN nodes. If job_completion_logger() only called the job completion plugin, then I would guess that it might be safe to move this call ahead of deallocate_nodes(). However, it seems like job_completion_logger() also does a bunch of accounting stuff (?), so perhaps that would need to be split out first? Also, there is the possibility that this is working as designed, though if so a well placed comment in the code might be appreciated. If the decreased nodecount is intended, though, should the DOWN nodes also be removed from the job's NODELIST? - Mark Grondona
-