- Jun 25, 2014
-
-
Morris Jette authored
-
Morris Jette authored
Log more backfill scheduling actions including where/when jobs will run even if they can not start until long in the future
-
Morris Jette authored
-
Danny Auble authored
-
Remi Palancher authored
file.
-
Bill Brophy authored
If we don't it could get messed up on a reconfig.
-
Danny Auble authored
-
Danny Auble authored
-
David Bigagli authored
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Logic used to identify cores which are usable with each GRES weas incorrect. bug 905
-
Danny Auble authored
QOS had a GrpNodes set.
-
Danny Auble authored
-
- Jun 24, 2014
-
-
Morris Jette authored
Fix for core-based advanced reservations where the distribution of cores across nodes is not even. Failing test case: system has 10 nodes, 1 of which is fully occupied create reservation with 9 nodes and 10 cores always would fail with "busy nodes" error
-
Morris Jette authored
-
- Jun 20, 2014
-
-
Morris Jette authored
The hostname was being set as an HDF5 value before the field was set in slurmstepd, resulting in SEGV. This change sets hostname before the HDF5 call and also tests for NULL before trying to set the value. Backtrace of failure: (gdb) bt 0 strlen () at ../sysdeps/x86_64/strlen.S:106 1 0x00007f4af5bb756d in put_string_attribute (parent=33554432, name=0x7f4af5bb8591 "Node Name", value=0x0) at src/plugins/acct_gather_profile/hdf5/hdf5_api.c:1711 2 0x00007f4af5bad224 in acct_gather_profile_p_node_step_start (job=0x194cc80) at src/plugins/acct_gather_profile/hdf5/acct_gather_profile_hdf5.c:372 3 0x000000000052ca86 in acct_gather_profile_g_conf_set (tbl=0x194cc80) at src/common/slurm_acct_gather_profile.c:490 4 0x000000000042e028 in batch_stepd_step_rec_create (msg=0x194d280) at src/slurmd/slurmstepd/slurmstepd_job.c:496 5 0x0000000000426ae5 in mgr_launch_batch_job_setup (msg=0x194d280, cli=0x194bec0) at src/slurmd/slurmstepd/mgr.c:422 6 0x00000000004263da in _step_setup (cli=0x194bec0, self=0x0, msg=0x194bd90) at slurmd/slurmstepd/slurmstepd.c:516 7 0x0000000000424302 in main (argc=1, argv=0x7fff1f7c6c98) at src/slurmd/slurmstepd/slurmstepd.c:127
-
Matthieu Hautreux authored
-
- Jun 19, 2014
-
-
David Bigagli authored
errno before each call to the API.
-
jette authored
Correct Shared field in job state information seen by scontrol, sview, etc.
-
jette authored
Only for pending jobs bug 899
-
Danny Auble authored
allocation and CR_PACK_NODES is set layout tasks appropriately. This is related to bug 890
-
Danny Auble authored
-
- Jun 18, 2014
-
-
David Bigagli authored
the sinfo command is parallelized for performance reasons and it really can not be completely parallelized for some use cases. see bug 883
-
Morris Jette authored
-
- Jun 17, 2014
-
-
Morris Jette authored
Correct logic to support Power7 processor with 1 or 2 threads per core (CPU IDs are not consecutive). bug 891
-
Morris Jette authored
There was some logic copied from bitstring.c and an unused variable problem reported by the compiler.
-
Morris Jette authored
-
Morris Jette authored
This is due to a bug introduced in commit 83d626ca SOme configurations could result in NULL names in the node list table (e.g. hidden partitions).
-
Morris Jette authored
SLowness introduced in commit 83d626ca
-
Morris Jette authored
This reverts commit 0d6a9965 That patch would permit a job with shared resources to run on the same node as a job without shared resources, unfortunately it let those jobs share CPUs. Finer grained sharing might be possible with extensive code changes, but not something to work on now.
-
https://github.com/SchedMD/slurmjette authored
-
jette authored
Without this change, the job's --shared option when used with a partition configuration of Shared=YES was not being honored by the select/cons_res or select/serial plugin.
-
jette authored
Original code was implicitly setting a job's shared field to 1 for select/cons_res.
-
Danny Auble authored
-
David Bigagli authored
-
ggeorgakoudis authored
display larger sharing values.
-
David Bigagli authored
-