Commits · d6d78c069921d933ee4e1b4e92361606a83f09be · tud-zih-energy / Slurm

Oct 29, 2012

Cray - Prevent calling basil_confirm more than once per job using a flag. · faa96d55

Morris Jette authored 12 years ago

Anyhow, after applying the patch, I was still running into the same difficulty. Upon a closer look, I saw that I was still receiving the ALPS backend error in the slurmctld.log file. When I examined the code pertaining this and ran some SLURM-independent tests, I found that we were executing the do_basil_confirm function multiple times in the cases where it would fail. My independent tests show precisely the same behaviour; that is, if you make a reservation request, then successfully confirm it and then attempt to confirm it again, you receive this error message. However, the "apstat -rvv" command shows that the ALPS reservation is fine and therefore I concluded that this particular ALPS/BASIL message is more of an informational one and not a "show-stopper." In other words, I can consider the node ready at this point.
As a simple work around, I currently just inserted an if-block immediately after the call to "basil_confirm" in function "do_basil_confirm" in ".../src/plugins/select/cray/basil_interface.c." The if-statment checks for "BE_BACKEND" and if this is the result then it prints an informational message to slurmctld.log and sets the variable rc=0 so that we can consider the node ready. This, now allows my prolog scripts to run and I can clearly see the SLURM message that I had placed in that if-block.
However, I am not certain if we really should just allow this error code to pass through as it seems like it could be a fairly generic code and there could be various other causes of it where we would not wish to allow it to pass. I really only want to limit the number of calls to basil_confirm to one. Perhaps I could add a field to the job_record so that I can mark whether the ALPS reservation had been confirmed or not.

faa96d55

Oct 26, 2012
- Accounting - Change empty jobacctinfo structs to not actually be used · d3da7afd
  Danny Auble authored 12 years ago
  
  instead of putting 0's into the database we put NO_VALS and have sacct figure out jobacct_gather wasn't used.
  d3da7afd
- Intel MIC processor support added using gres/mic plugin · 9092b68d
  Olli-Pekka Lehto authored 12 years ago
  
  9092b68d
Oct 25, 2012
- Correction to slurmdbd communications failure handling logic · 26871b8d
  Morris Jette authored 12 years ago
  
  Incorrect error codes returned in some cases, especially if the slurmdbd is down
  26871b8d
- Cray - Defer salloc until after PrologSlurmctld completes. · a5645a19
  Morris Jette authored 12 years ago
  
  a5645a19
Oct 24, 2012
- smap - spread node information across multiple lines for larger systems. · 2c8bd966
  Morris Jette authored 12 years ago
  
  Previously for linux systems all information was placed on a single line.
  2c8bd966
Oct 23, 2012
- GQ - Cleaner handling of cnode failures when reported through the runjob · f6a33bad
  Danny Auble authored 12 years ago
  
  interface instead of through the normal method.
  f6a33bad
Oct 22, 2012
- BGQ - Fix for printing realtime server debug correctly. · 9054e4e0
  Danny Auble authored 12 years ago
  
  9054e4e0
- Update NEWS to describe recent work by Matthieu Hautreux, CEA on task/cgroup · d65fa557
  Morris Jette authored 12 years ago
  
  d65fa557
- Add ReconfigFlags value of KeepPartState. See "man slurm.conf" for details. · 7fc102c1
  Don Albert authored 12 years ago
  
  7fc102c1
Oct 19, 2012
- Permit reservations to allow or deny access by account and/or user. · 91562433
  Morris Jette authored 12 years ago
  
  91562433
Oct 18, 2012
- BGQ - Make it so if a nodeboard goes in error any block using that midplane · ea39371a
  Danny Auble authored 12 years ago
  
  for passthrough gets removed on a dynamic system.
  ea39371a
- BGQ - Add logic to make it so blocks can't use a midplane with a nodeboard · 4b1f6608
  Danny Auble authored 12 years ago
  
  in error for passthrough.
  4b1f6608
- Add SLURM_NODELIST to env vars available to Prolog and Epilog. · 7040bd62
  Morris Jette authored 12 years ago
  
  7040bd62
- Fixed InactiveLimit math to work correctly · 13a8882a
  Danny Auble authored 12 years ago
  
  13a8882a
- BGQ - Fixed InactiveLimit to work correctly to avoid scenarios where a · 65fef1ff
  Danny Auble authored 12 years ago
  
  user's pending allocation was started with srun and then for some reason the slurmctld was brought down and while it was down the srun was removed.
  65fef1ff
- BGQ - Add functionality to make it so we track the actions on a block. · baf267e0
  Danny Auble authored 12 years ago
  
  This is needed for when a free request is added to a block but there are jobs finishing up so we don't start new jobs on the block since they will fail on start.
  baf267e0
Oct 17, 2012
- BlueGene - don't change pending job's node count when changing partition. · 2b59f495
  Morris Jette authored 12 years ago
  
  Previously the node count would change from c-node count to midplane count (but still be interpreted as a c-node count).
  2b59f495
- Minor formatting changes to priority/multifactor2 plugin · 291c3d86
  jette authored 12 years ago
  
  No real changes to logic other than some additional error checking.
  291c3d86
Oct 16, 2012
- Optimize preemption logic · 90e4dfa5
  Morris Jette authored 12 years ago
  
  Preempt jobs only when insufficient idle resources exist to start job, regardless of the node weight.
  90e4dfa5
- Fix for older < glibc 2.4 systems to use euidaccess instead of eaccess. · d9e28215
  Danny Auble authored 12 years ago
  
  d9e28215
Oct 05, 2012

Restore gang scheduling functionality. · c60cd749
Morris Jette authored 12 years ago
```
Preemptor was not being scheduled.
Fix for bugzilla #3.
```
c60cd749

Revert commit · 1a5e1936

Morris Jette authored 12 years ago

While this change lets gang scheduling happen, it overallocates
resources from different priority partitions when gang scheduling
is not running.

1a5e1936

Oct 04, 2012
- bug in allocating resources with Shared=NO and gang scheduling · 5deba75c
  Morris Jette authored 12 years ago
  
  Preemptor was not being scheduled. See bugzilla #3 for details
  5deba75c
Oct 02, 2012

Correct -mem-per-cpu logic for multiple threads per core · 6a103f2e

Morris Jette authored 12 years ago

See bugzilla bug 132

When using select/cons_res and CR_Core_Memory, hyperthreaded nodes may be
overcommitted on memory when CPU counts are scaled. I've tested 2.4.2 and HEAD
(2.5.0-pre3).

Conditions:
-----------
* SelectType=select/cons_res
* SelectTypeParameters=CR_Core_Memory
* Using threads
  - Ex. "NodeName=linux0 Sockets=1 CoresPerSocket=4 ThreadsPerCore=2
RealMemory=400"

Description:
------------
In the cons_res plugin, _verify_node_state() in job_test.c checks if a node has
sufficient memory for a job. However, the per-CPU memory limits appear to be
scaled by the number of threads. This new value may exceed the available memory
on the node. And, once a node is overcommitted on memory, future memory checks
in _verify_node_state() will always succeed.

Scenario to reproduce:
----------------------
With the example node linux0, we run a single-core job with 250MB/core
    srun --mem-per-cpu=250 sleep 60

cons_res checks that it will fit: ((real - alloc) >= job mem)
    ((400 - 0) >= 250) and the job starts

Then, the memory requirement is doubled:
    "slurmctld: error: cons_res: node linux0 memory is overallocated (500) for
job X"
    "slurmd: scaling CPU count by factor of 2"

This job should not have started

While the first job is still running, we submit a second, identical job
    srun --mem-per-cpu=250 sleep 60

cons_res checks that it will fit:
    ((400 - 500) >= 250), the unsigned int wraps, the test passes, and the job
starts

This second job also should not have started

6a103f2e

Modify strigger so that a filter option of "--user=0" is supported · 7166976e
Morris Jette authored 12 years ago

7166976e

Sep 27, 2012
- BGQ - Logic added to make sure a job has finished on a block before it is · 0badb119
  Danny Auble authored 12 years ago
  
  purged from the system if its front-end node goes down.
  0badb119
- BGQ - If a job goes away while still trying to free it up in the · 064ee393
  Danny Auble authored 12 years ago
  
  database, and the job is running on a small block make sure we free up the correct node count.
  064ee393
- Fix for srun --test-only to work correctly with timelimits · 36e819e5
  Bill Brophy authored 12 years ago
  
  36e819e5
Sep 25, 2012
- Added Prolog and Epilog Guide (web page) · e8ca6922
  Morris Jette authored 12 years ago
  
  Based upon work by Jason Sollom, Cray Inc. and used by permission
  e8ca6922
Sep 24, 2012
- Execute slurm_spank_job_epilog when there is no system Epilog configured. · c57ab123
  Morris Jette authored 12 years ago
  
  This addresses bug 130
  c57ab123
Sep 21, 2012
- BGQ - Fix issue when a cnode going to an error (not SoftwareError) state · 4b1aed73
  Danny Auble authored 12 years ago
  
  with a job running or trying to run on it.
  4b1aed73
Sep 20, 2012
- BGQ - Fix if large block goes into error and the next highest priority jobs · 7e53c48f
  Danny Auble authored 12 years ago
  
  are planning on using the block. Previously it would fail those jobs erroneously.
  7e53c48f
Sep 19, 2012
- BGQ - minor fix to make build work in emulated mode. · c9f14f80
  Danny Auble authored 12 years ago
  
  c9f14f80
Sep 18, 2012
- Update META and NEWS for v2.5.0-pre3 tag · 782fb794
  Morris Jette authored 12 years ago
  
  782fb794
- Updates for v2.4.3 tag · 1410b960
  Morris Jette authored 12 years ago
  
  1410b960
- Added all available limits to the output of sacctmgr list qos · 2c500639
  Danny Auble authored 12 years ago
  
  2c500639
Sep 17, 2012
- Fix sacct to work with QOS' that have previously been deleted. · 48bf06d8
  Danny Auble authored 12 years ago
  
  48bf06d8
- CRAY - Update documentation to describe installation from rpm instead · f7321e1a
  Danny Auble authored 12 years ago
  
  or previous piecemeal method.
  f7321e1a
Sep 15, 2012
- CRAY - Fix for sacct -N option to work correctly · a6ffef22
  Danny Auble authored 12 years ago
  
  Adapted from a patch from Stephen Trofinoff <trofinoff@cscs.ch>
  a6ffef22