- Jun 29, 2014
-
-
jette authored
sched/backfill - If job started with infinite time limit, set its end_time one year in the future. Previous logic would based it upon INFINITE and produce an overflow of the uint32_t time.
-
- Jun 27, 2014
-
-
David Bigagli authored
-
Morris Jette authored
Report correct CPU count allocated to job when allocated whole node even if not using all CPUs. bug 909
-
David Bigagli authored
-
Morris Jette authored
Rather than relying upon the default shared value being (uint16_t)NO_VAL;
-
Morris Jette authored
Don't log the job's incoming nice value is NO_VAL - NICE_OFFSET
-
Morris Jette authored
A function was added in commit 7cae0abb that was essentially identical to a previously existant function. The redundant function has been removed.
-
Morris Jette authored
Fix bug which could report to squeue the wrong partition for a running job that is submitted to multiple partitions.
-
- Jun 26, 2014
-
-
Danny Auble authored
-
Danny Auble authored
you could get multiple '*' wckeys.
-
Danny Auble authored
deleting users.
-
Danny Auble authored
able to be destroyed for debug.
-
Morris Jette authored
For each job the backfill scheduling is testing, print not only the job ID, but also its priority and partition.
-
Morris Jette authored
Correction to commit aeecd03c for some configuration
-
Morris Jette authored
Change the log format to match that of slurm.conf
-
Morris Jette authored
Clarify the meaning of the AllowGroups configuration parameter a bit more.
-
Morris Jette authored
Original version did not end an indent properly
-
Morris Jette authored
Remove some debugging logic accidentally included in commit 7a187f10
-
Morris Jette authored
sched/backfill: Correct job start time estimate with advanced reservations. Under some conditions, the job start time would be set to the end time of an advanced reservation that may have nothing to do with that job's resource requirements. bug 911
-
- Jun 25, 2014
-
-
Morris Jette authored
-
Morris Jette authored
Log more backfill scheduling actions including where/when jobs will run even if they can not start until long in the future
-
Morris Jette authored
-
Danny Auble authored
-
Remi Palancher authored
file.
-
Bill Brophy authored
If we don't it could get messed up on a reconfig.
-
Danny Auble authored
-
Danny Auble authored
-
David Bigagli authored
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Logic used to identify cores which are usable with each GRES weas incorrect. bug 905
-
Danny Auble authored
QOS had a GrpNodes set.
-
Danny Auble authored
-
- Jun 24, 2014
-
-
Morris Jette authored
Fix for core-based advanced reservations where the distribution of cores across nodes is not even. Failing test case: system has 10 nodes, 1 of which is fully occupied create reservation with 9 nodes and 10 cores always would fail with "busy nodes" error
-
Morris Jette authored
-
- Jun 20, 2014
-
-
Morris Jette authored
The hostname was being set as an HDF5 value before the field was set in slurmstepd, resulting in SEGV. This change sets hostname before the HDF5 call and also tests for NULL before trying to set the value. Backtrace of failure: (gdb) bt 0 strlen () at ../sysdeps/x86_64/strlen.S:106 1 0x00007f4af5bb756d in put_string_attribute (parent=33554432, name=0x7f4af5bb8591 "Node Name", value=0x0) at src/plugins/acct_gather_profile/hdf5/hdf5_api.c:1711 2 0x00007f4af5bad224 in acct_gather_profile_p_node_step_start (job=0x194cc80) at src/plugins/acct_gather_profile/hdf5/acct_gather_profile_hdf5.c:372 3 0x000000000052ca86 in acct_gather_profile_g_conf_set (tbl=0x194cc80) at src/common/slurm_acct_gather_profile.c:490 4 0x000000000042e028 in batch_stepd_step_rec_create (msg=0x194d280) at src/slurmd/slurmstepd/slurmstepd_job.c:496 5 0x0000000000426ae5 in mgr_launch_batch_job_setup (msg=0x194d280, cli=0x194bec0) at src/slurmd/slurmstepd/mgr.c:422 6 0x00000000004263da in _step_setup (cli=0x194bec0, self=0x0, msg=0x194bd90) at slurmd/slurmstepd/slurmstepd.c:516 7 0x0000000000424302 in main (argc=1, argv=0x7fff1f7c6c98) at src/slurmd/slurmstepd/slurmstepd.c:127
-
Matthieu Hautreux authored
-
- Jun 19, 2014
-
-
David Bigagli authored
errno before each call to the API.
-
jette authored
Correct Shared field in job state information seen by scontrol, sview, etc.
-