Commits · 395b5505c7fa614ae116bed4e1137010856362d2 · tud-zih-energy / Slurm

Apr 11, 2016

backfill - minor performance enahcements · 395b5505

Morris Jette authored 8 years ago

The gprof tool is showing most time is being consumed by the bit_test()
  function as called from the select plugin, which in turn was called
  by the backfill scheduler. These changes replace the for loop end-points.
  Previous logic tested for all possible nodes. The new logic identifes
  the first and last bit set in the node bitmap and uses those end-points
  instead. Node the logic to find the first and last bits set starts off
  with a word-based search (testing for a 64-bit zero value rather than
  testing each individual bit). The net result is a small performance
  improvement.
bug 2588

395b5505

Fix three typos. · e6e87c92
Tim Wickberg authored 8 years ago

e6e87c92
burst_buffer/cray fix for pre_run fail · 8f667db4
Morris Jette authored 8 years ago
```
burst_buffer/cray - Decrement job's prolog_running counter if pre_run fails.
bug 2621
```
8f667db4

Reset job's prolog_running counter · f3f41e10

Morris Jette authored 8 years ago

If a job is no longer in configuring state, then clear the prolog_running
  counter on slurmctld restart or reconfigure.
bug 2621

f3f41e10

Apr 09, 2016

Fix for commit · 06776b12

Morris Jette authored 8 years ago

For case where job can't start and there are no running jobs
to remove in order to establish estimated start time.

06776b12

backfill scheduling enhancement · e62a9270

Morris Jette authored 8 years ago

When determining when a pending job will be able to start, rather
than testing after removing each running job and trying to schedule
the pending jobs, remove multiple jobs that all end about the
same time before testing. This reduces the number of calls to
the job placement logic, which is time consuming.

e62a9270

Apr 08, 2016
- Add new list function · 654f3bf8
  Morris Jette authored 8 years ago
  
  list_peek_next(), like list_next() but WITHOUT advancing the pointer
  654f3bf8
- Expand backfill scheduling logs · b3a49e14
  Morris Jette authored 8 years ago
  
  b3a49e14
Apr 07, 2016

Log poor backfill configuration parameters · 5675c5f7

Morris Jette authored 8 years ago

Document and log cases where max jobs per user or partition is
  equal or greater than the max jobs test. In that case, a single
  user can easily stop all backfill scheduling.

5675c5f7

Fix handling for single-character prognames · 11320ebc
Sami Ilvonen authored 8 years ago

11320ebc

fix for job "--contiguous" option · 47a07b54

Morris Jette authored 8 years ago

Fix for job "--contiguous" option that could cause job allocation/launch
    failure or slurmctld crash.
bug 2573

47a07b54

Apr 06, 2016
- Start NEWS for v15.08.11 · 3a8ecf32
  Morris Jette authored 8 years ago
  
  3a8ecf32
- Update META for v15.08.10 tag · cb2ea0bb
  Morris Jette authored 8 years ago
  
  cb2ea0bb
- Revert "Fix situation on a heterogeneous memory cluster where the order of" · 3ae45a51
  Danny Auble authored 8 years ago
  
  This reverts commit f559a55c.
  3ae45a51
- Fix situation on a heterogeneous memory cluster where the order of · f559a55c
  Danny Auble authored 8 years ago
  
  constraints mattered in a job. Details include: A job doesn't request memory but the system is running with CR_*MEMORY with no default memory limit and the job requests nodes with features of different sizes. Previously the order of constraints mattered where the smaller memory node would need to be requested first or the job would fail. Bug 2608
  f559a55c
- Don't change job time limit when updating unrelated field in a job · 594c7997
  Morris Jette authored 8 years ago
  
  Previous logic would get an account and/or QOS time limit and use that value to overwrite the incoming RPC's NO_VAL value, which would change a job's time limit when changing an unrelated field (e.g. priority, QOS, etc.). bug 2610
  594c7997
- Avoid double calculation on partition QOS if the job is using the same QOS. · e17a7eaf
  Danny Auble authored 8 years ago
  
  e17a7eaf
- Fix for SEGV · 55d31288
  Morris Jette authored 8 years ago
  
  Prevent use of NULL pointer and SEGV when changing a job's QOS when the slurmdbd is not configured.
  55d31288
- Fix spelling of 'daemon'. · b714beb6
  Tim Wickberg authored 8 years ago
  
  b714beb6
Apr 05, 2016

Fix backfill scheduler race condition · d8b18ff8

Morris Jette authored 8 years ago

Fix backfill scheduler race condition that could cause invalid pointer in
    select/cons_res plugin. Bug introduced in 15.08.9, commit:
    efd9d35e

The scenario is as follows
1. Backfill scheduler is running, then releases locks
2. Main scheduling loop starts a job "A"
3. Backfill scheduler resumes, finds job "A" in its queue and
   resets it's partition pointer.
4. Job "A" completes and tries to remove resource allocation record
   from select/cons_res data structure, but fails to find it because
   it is looking in the table for the wrong partition.
5. Job "A" record gets purged from slurmctld
6. Select/cons_res plugin attempts to operate on resource allocation
   data structure, finds pointer into the now purged data structure
   of job "A" and aborts or gets SEGV
Bug 2603

d8b18ff8

Rename function, no real code change. The old function name was completely · 6f0c2d3f
Danny Auble authored 8 years ago
```
misleading.
```
6f0c2d3f
Remove debug from commit 921c59e4 · 24566dd7
Danny Auble authored 8 years ago

24566dd7

Apr 04, 2016
- Remove duplicates from AccountingStorageTRES · 921c59e4
  Danny Auble authored 8 years ago
  
  921c59e4
- Add slurm_set_accounting_storage_tres · 5751b9d6
  Danny Auble authored 8 years ago
  
  5751b9d6
- If using PrologFlags=contain: Don't launch the extern step if a job is · 91a83e41
  Danny Auble authored 8 years ago
  
  canceled while launching.
  91a83e41
- Change in comment for greater clarity · 3f51a788
  Morris Jette authored 8 years ago
  
  3f51a788
Apr 02, 2016
- checkpoint/blcr plugin: Fix memory leak. · 08d520db
  Morris Jette authored 8 years ago
  
  08d520db
- Fix potential divide by zero when tree_width=1 · ef8c5e1b
  Danny Auble authored 8 years ago
  
  ef8c5e1b
Apr 01, 2016
- Cosmetic change, no change to logic · fabc772e
  Morris Jette authored 8 years ago
  
  fabc772e
Mar 31, 2016

power/cray fix for nodes not ready · 5b0800e4

Morris Jette authored 8 years ago

Power/cray: Don't specify NID list to Cray APIs. If any of those nodes are
    not in a ready state, the API returned an error for ALL nodes rather than
    valid data for nodes in ready state.
bug 2332

5b0800e4

Make error message in the pmi2 code to debug as the issue can be expected · bcccd20c
Matthieu Hautreux authored 8 years ago
```
and retries are done making the error message a little misleading.
```
bcccd20c

Mar 30, 2016
- Fix issue where if a slurmdbd rollup lasted longer than 1 hour the · 2bec1975
  Danny Auble authored 8 years ago
  
  rollup would effectively never run again. bug 2575 and sort of bug 2596
  2bec1975
- Update NEWS for start of v15.08.10 · 173eb1e6
  Morris Jette authored 8 years ago
  
  173eb1e6
- Update META for v15.08.9 tag · 7444b066
  Morris Jette authored 8 years ago
  
  7444b066
Mar 28, 2016

task/cgroup - Fix task binding to CPUs bug · ddf6d9a4

Morris Jette authored 9 years ago

There was a subtle bug in how tasks were bound to CPUs which could result
in an "infinite loop" error. The problem was various socket/core/threasd
calculations were based upon the resources allocated to a step rather than
all resources on the node and rounding errors could occur. Consider for
example a node with 2 sockets, 6 cores per socket and 2 threads per core.
On the idle node, a job requesting 14 CPUs is submitted. That job would
be allocted 4 cores on the first socket and 3 cores on the second socket.
The old logic would get the number of sockets for the job at 2 and the
number of cores at 7, then calculate the number of cores per socket at
7/2 or 3 (rounding down to an integer). The logic layouting out tasks
would bind the first 3 cores on each socket to the job then not find any
remaining cores, report the "infinite loop" error to the user, and run
the job without one of the expected cores. The problem gets even worse
when there are some allocated cores on a node. In a more extreme case,
a job might be allocated 6 cores on one socket and 1 core on a second
socket. In that case, 3 of that job's cores would be unused.
bug 2502

ddf6d9a4

Fix for srun signal handling threading problem · c8d36dba

Morris Jette authored 9 years ago

This is a revision to commit 1ed38f26
The root problem is that a pthread is passed an argument which is
a pointer to a variable on the stack. If that variable is over-written,
the signal number recieved will be garbage, and that bad signal
number will be interpretted by srun to possible abort the request.

c8d36dba

Mar 26, 2016

Revert commit · c1dde86c

Morris Jette authored 9 years ago

The previous commit obviously fixed a problem, but introduced a different
set of problems. This will be pursued later, perhaps in version 16.05.

c1dde86c

Mar 25, 2016

Revert commit · f5920b77

Morris Jette authored 9 years ago

With some configurations and systems, errors of the following sort were
occuring:
task/cgroup: task[1] infinite loop broken while trying to provision compute elements using block
task/cgroup: task[1] unable to set taskset '0x0'

f5920b77

burst_buffer/cray - pre-run fail fix · 5a48207e

Morris Jette authored 9 years ago

burst_buffer/cray - If the pre-run operation fails then don't issue
    duplicate job cancel/requeue unless the job is still in run state. Prevents
    jobs hung in COMPLETING state.
bug 2587

5a48207e

Mar 24, 2016

Select/cray - Log NHC run time on "scontrol reconfig" · 58627d02

Morris Jette authored 9 years ago

Running "scontrol reconfig" releases resources for jobs waiting for
  the completion of Node Health Check so that other jobs can run.
  Cray says to always wait for NHC to complete, but in extreme
  cases that can be 2 hours, during which the entire resource
  allocation for a job may be unusable. Per advice from NERSC,
  the logic to release resources is unchanged, but logging is
  added here.

58627d02