Commits · 8f2951232ba5cd390567a12b1bedf5541fc51cb4 · tud-zih-energy / Slurm

Nov 22, 2013
- Update NEWS file. · 8f295123
  David Bigagli authored 11 years ago
  
  8f295123
- Fix conditional rpm compiling. · 39f0822a
  David Bigagli authored 11 years ago
  
  39f0822a
Nov 20, 2013
- Temporary fix. If slurmstepd dies do not hang srun but abort the · 58c12f7e
  David Bigagli authored 11 years ago
  
  entire parallel job.
  58c12f7e
Nov 18, 2013
- Merge branch 'slurm-2.6' · 5759a1d1
  Morris Jette authored 11 years ago
  
  Conflicts: doc/html/faq.shtml
  5759a1d1
- document performance issue for shared=yes|force · 674639de
  Morris Jette authored 11 years ago
  
  The time/resource allocation matrix is rebuilt on each job exit, which severely impacts performance at large counts of running jobs (say >10k jobs).
  674639de
- Add to contributor list · ae545f62
  Morris Jette authored 11 years ago
  
  ae545f62
- Fix to backfill scheduling for 32-bit task array index · c4840eb2
  Michel Hummel authored 11 years ago
  
  Logic in backfill scheduler did not change when array_task_id was changed from 16 to 32-bit
  c4840eb2
Nov 16, 2013
- Added new configuration parameter AuthInfo · 581bd1d8
  Phil Eckert authored 11 years ago
  
  581bd1d8
- mpirun-mic - Major re-write for Xeon Phi use. · 3a7875fd
  Chrysovalantis Paschoulas authored 11 years ago
  
  3a7875fd
Nov 15, 2013
- DefMemPerCPU=UNLIMITED and MaxMemPerCPU=UNLIMITED when these · 694e724c
  Rod Schultz authored 11 years ago
  
  limits are configured as 0.
  694e724c
- job_submit/lua - add cpus_per_task field · 7af9ec8d
  Morris Jette authored 11 years ago
  
  bug 511
  7af9ec8d
- Clarify meaning of CPU in a FAQ · 899c988e
  Morris Jette authored 11 years ago
  
  899c988e
- Add support for node UNDRAIN · 994b8f27
  Morris Jette authored 11 years ago
  
  Add ability to clear a node's DRAIN flag using scontrol or sview by setting it's state to "UNDRAIN". The node's base state (e.g. "DOWN" or "IDLE") will not be changed. bug 514
  994b8f27
Nov 14, 2013
- CRAY - make sview work on a native cray · ea3567c3
  Danny Auble authored 11 years ago
  
  ea3567c3
- CRAY - Fix default issue · 42e551b6
  Danny Auble authored 11 years ago
  
  42e551b6
- job_submit/lua - add cpus_per_task field · f7777cd5
  Morris Jette authored 11 years ago
  
  bug 511
  f7777cd5
- Clarify meaning of CPU in a FAQ · 97f8a501
  Morris Jette authored 11 years ago
  
  97f8a501
- use slurm_atoul instead of atoi · 3a062444
  Danny Auble authored 11 years ago
  
  3a062444
- Fix Typecasting issues · 7328be5b
  Danny Auble authored 11 years ago
  
  7328be5b
- Merge remote-tracking branch 'origin/slurm-2.6' · 30ec047b
  Danny Auble authored 11 years ago
  
  30ec047b
- Avoid uninitialized variable error · 2952d858
  Morris Jette authored 11 years ago
  
  2952d858
- jobcomp/mysql - Fix smallint to int for uid/gid · 143703fc
  Danny Auble authored 11 years ago
  
  143703fc
- Improve PAM installation description in the FAQ · a3b08641
  Morris Jette authored 11 years ago
  
  a3b08641
Nov 13, 2013

Merge branch 'slurm-2.6' · 05de12b7
Morris Jette authored 11 years ago

05de12b7
CRAY- Add -e option for the xtcleanup_after call. Also made code more · eb21ccca
Danny Auble authored 11 years ago
```
modular.
```
eb21ccca
Update large systems named on web page · 18d55013
Morris Jette authored 11 years ago

18d55013

Add _DEBUG parameter to reservation module · 37e81da6

Morris Jette authored 11 years ago

This makes it simpler to enable detailed debugging for reservations.
This includes more information than we probably want to see with
the DebugFlag=reservation and would be only for developer debugging

37e81da6

Corrections to advanced reservation logic with overlapping jobs. · d6954b77

Morris Jette authored 11 years ago

This might have worked fine for core reservations or when there
are sufficient idle nodes to use, the the select_g_resv_test()
function clears the node bitmap for nodes that it can not use
and the reservation create logic did not restore that bitmap
after a failed resource selection attempt. This logic restores
the node bitmap on a failed call to select_g_resv_test() so we
can add nodes to the bitmap of available nodes rather than having
it repeatedly cleared.
The logic also adds some performance enhancements that I will
add to in the next commit.

d6954b77

Remove vestigial RMS man page reference · 9c5a4b6f
Morris Jette authored 11 years ago

9c5a4b6f
Add contributor name to web page · 214c2380
jette authored 11 years ago

214c2380
Correct description of a module · ff0ed10c
jette authored 11 years ago

ff0ed10c
Update NEWS file for commit dc0c4e29 . · 3643c8a9
David Bigagli authored 11 years ago

3643c8a9

Fix bug in job step allocation failing due to memory limit · 21ed817c

Morris Jette authored 11 years ago

This fixes a bug where a system is enforcing memory limits and
the job already has a step running on some of the nodes then
tries to start another step using some of those nodes. For example
wwith DefMemPerNode configured and the select plugin enforcing
memory limits, try:
salloc -N2 bash
$ srun -N1 sleep 10&
$ srun -N2 hostname
Without this patch, the second srun would fail instead of pend.

21ed817c

Nov 12, 2013
- Only print message once if cgroup was killed by oom (since this is checked · 06601ef0
  Danny Auble authored 11 years ago
  
  on a task level if any task hit it the check will be triggered)
  06601ef0
- Comment the cgroup blkio code until the full kernel support is ready. · d60434f2
  David Bigagli authored 11 years ago
  
  d60434f2
- minor format issue · e27b13f0
  Danny Auble authored 11 years ago
  
  e27b13f0
- Update the team list. · d4da4e6f
  David Bigagli authored 11 years ago
  
  d4da4e6f
- Add code to make the script act more like pbsnodes on a TORQUE system. · dc0c4e29
  Troy Baer authored 11 years ago
  
  dc0c4e29
- Prevent compiler from complaining. · 97c40a62
  David Bigagli authored 11 years ago
  
  97c40a62
- Make the rpm build process not happen on a cray · fc4c31d8
  Danny Auble authored 11 years ago
  
  fc4c31d8