- Jul 01, 2011
-
-
Morris Jette authored
Export the symbol s_p_hashtbl_destroy as slurm_s_p_hashtbl_destroy in libslurm so that external programs can link to libslurm and be able to reference select/cray without undefined symbols
-
Morris Jette authored
If the gres/gpu plugin is used, then set the CUDA_VISIBLE_DEVICE environment variable to "NoDevFiles" if the gres.conf file has no device files identified for generic recources (GRES) of type GPU. Otherwise set the device file sequence number(s).
-
- Jun 30, 2011
-
-
Morris Jette authored
-
Morris Jette authored
Enhancements to sched/backfill performance with select/cons_res plugin. Major improvements would be seen with large job counts. Based upon bf_build_row_bitmaps_2.2.6.patch patch from Bjørn-Helge Mevik, University of Oslo.
-
Morris Jette authored
Add a call to acct_policy_job_runnable() to check if a job should have been marked with AssociationResourceLimit. This avoids trying to backfill any job who wouldn't be allowed to run now anyway. Patch bf_acct_policy_check_2.2.6.patch from Bjørn-Helge Mevik, University of Oslo
-
Morris Jette authored
Have salloc log (with verbose option) when the job is suspended or resumed, but do no suspend or resume it's child processes when that occurs. ALPS has been modified to do so and salloc no longer needs to do it.
-
- Jun 29, 2011
-
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
- Jun 28, 2011
-
-
Danny Auble authored
association limit.
-
Danny Auble authored
-
Morris Jette authored
Permit scontrol to update a partition's default and maximum memory limits per job.
-
- Jun 27, 2011
-
-
Morris Jette authored
Add basil_switch function to select/cray libemulate (ALPS eumulation library) to function properly with emulated Cray system.
-
Morris Jette authored
Add default and maximum memory limits on a per-partitiion basis. If not specified, the system-wide memory limits will apply.
-
Morris Jette authored
-
Morris Jette authored
Describe alps.conf changes required for gang scheduling on a cray system
-
Morris Jette authored
Auble and Jette listed with SchedMD rather than LLNL.
-
Morris Jette authored
Update web page header and footer to be generic (not LLNL-centric) and add generic disclaimer web page.
-
Danny Auble authored
-
Danny Auble authored
from how they are in the slurm.conf file but doesn't use the a topology plugin.
-
Morris Jette authored
Change use of pointer in a step launch RPC to a copy of the data structure which will insure that the step data does not change while the RPC message is being built. No problems have been observed, but this will be safer.
-
Danny Auble authored
-
- Jun 25, 2011
-
-
Morris Jette authored
"scontrol show config" was reporting both per CPU and per node memory limits as being per CPU limits. Change to report per node limits with the proper key name.
-
-
Morris Jette authored
Correct values mainted for suspended job count (sus_job_cnt) by node and run job count (job_cnt_run) by front-end node when reconfiguring the slurmctld daemon while there are suspended jobs on a front-end architecture.
-
- Jun 24, 2011
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
scheduling
-
Morris Jette authored
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Add select_jobinfo to the task launch RPC so that all nodes have access to the information and not job the head node. Based upon patch by Andriy Grytsenko (Massive Solutions Limited).
-
Morris Jette authored
-
Danny Auble authored
Needed for gang scheduling.
-
Morris Jette authored
Fix possible invalid memory reference in sched/backfill. Patch by Andriy Grytsenko (Massive Solutions Limited).
-
Morris Jette authored
-
Morris Jette authored
Add flag to the select APIs for job suspend/resume indicating if the action is for gang scheduling or an explicit job suspend/resume by the user. Only an explicit job suspend/resume will reset the job's priority and make resources exclusively held by the job available to other jobs. This change is also needed for Cray systems with ALPS.
-