- Jun 25, 2014
-
-
Danny Auble authored
QOS had a GrpNodes set.
-
Danny Auble authored
-
- Jun 24, 2014
-
-
Morris Jette authored
Fix for core-based advanced reservations where the distribution of cores across nodes is not even. Failing test case: system has 10 nodes, 1 of which is fully occupied create reservation with 9 nodes and 10 cores always would fail with "busy nodes" error
-
Morris Jette authored
-
- Jun 20, 2014
-
-
Morris Jette authored
The hostname was being set as an HDF5 value before the field was set in slurmstepd, resulting in SEGV. This change sets hostname before the HDF5 call and also tests for NULL before trying to set the value. Backtrace of failure: (gdb) bt 0 strlen () at ../sysdeps/x86_64/strlen.S:106 1 0x00007f4af5bb756d in put_string_attribute (parent=33554432, name=0x7f4af5bb8591 "Node Name", value=0x0) at src/plugins/acct_gather_profile/hdf5/hdf5_api.c:1711 2 0x00007f4af5bad224 in acct_gather_profile_p_node_step_start (job=0x194cc80) at src/plugins/acct_gather_profile/hdf5/acct_gather_profile_hdf5.c:372 3 0x000000000052ca86 in acct_gather_profile_g_conf_set (tbl=0x194cc80) at src/common/slurm_acct_gather_profile.c:490 4 0x000000000042e028 in batch_stepd_step_rec_create (msg=0x194d280) at src/slurmd/slurmstepd/slurmstepd_job.c:496 5 0x0000000000426ae5 in mgr_launch_batch_job_setup (msg=0x194d280, cli=0x194bec0) at src/slurmd/slurmstepd/mgr.c:422 6 0x00000000004263da in _step_setup (cli=0x194bec0, self=0x0, msg=0x194bd90) at slurmd/slurmstepd/slurmstepd.c:516 7 0x0000000000424302 in main (argc=1, argv=0x7fff1f7c6c98) at src/slurmd/slurmstepd/slurmstepd.c:127
-
Matthieu Hautreux authored
-
- Jun 19, 2014
-
-
David Bigagli authored
errno before each call to the API.
-
jette authored
Correct Shared field in job state information seen by scontrol, sview, etc.
-
jette authored
Only for pending jobs bug 899
-
Danny Auble authored
allocation and CR_PACK_NODES is set layout tasks appropriately. This is related to bug 890
-
Danny Auble authored
-
- Jun 18, 2014
-
-
David Bigagli authored
the sinfo command is parallelized for performance reasons and it really can not be completely parallelized for some use cases. see bug 883
-
Morris Jette authored
-
- Jun 17, 2014
-
-
Morris Jette authored
Correct logic to support Power7 processor with 1 or 2 threads per core (CPU IDs are not consecutive). bug 891
-
Morris Jette authored
There was some logic copied from bitstring.c and an unused variable problem reported by the compiler.
-
Morris Jette authored
-
Morris Jette authored
This is due to a bug introduced in commit 83d626ca SOme configurations could result in NULL names in the node list table (e.g. hidden partitions).
-
Morris Jette authored
SLowness introduced in commit 83d626ca
-
Morris Jette authored
This reverts commit 0d6a9965 That patch would permit a job with shared resources to run on the same node as a job without shared resources, unfortunately it let those jobs share CPUs. Finer grained sharing might be possible with extensive code changes, but not something to work on now.
-
https://github.com/SchedMD/slurmjette authored
-
jette authored
Without this change, the job's --shared option when used with a partition configuration of Shared=YES was not being honored by the select/cons_res or select/serial plugin.
-
jette authored
Original code was implicitly setting a job's shared field to 1 for select/cons_res.
-
Danny Auble authored
-
David Bigagli authored
-
ggeorgakoudis authored
display larger sharing values.
-
David Bigagli authored
-
Morris Jette authored
cores specialization test 17.34 failed with bad assignment logic
-
- Jun 16, 2014
-
-
Morris Jette authored
-
Morris Jette authored
-
- Jun 14, 2014
-
-
jette authored
If FastSchedule=0 is configured and some nodes have not registered for service (so we do not know their actual resource counts), then leave the job pending rather than rejecting it without knowing if it can run later (when the node registers and we know its specs). bug 872
-
jette authored
-
- Jun 13, 2014
- Jun 12, 2014
-
-
Morris Jette authored
For "scontrol --details show job" report the correct CPU_IDs when thre are multiple threads per core (we are translating a core bitmap to CPU IDs). This is an enhancement of commit 83d626ca so the node table is only loaded once for the entire job table. bug 850
-
Martin Perry authored
Correct the record of CPU_IDs allocated to a job if there is more than one CPU per core.
-
Morris Jette authored
If job requests --exclusive then do not use nodes which have any cores in an advanced reservation. Also prevents case where nodes can be shared by other jobs.
-
Morris Jette authored
Disable some logging that would be very slow unless the _DEBUG flag is set in the plugin
-
Morris Jette authored
If job requests --exclusive then do not use nodes which have any cores in an advanced reservation. Previously the job would be allocated all of the cores outside of the advanced reservation.
-
Morris Jette authored
Correct support for partition with Shared=YES configuration. Previous logic would share resources for jobs by default (i.e. if user did not explicitly request --exclusive). bug 758
-