- Dec 06, 2011
-
-
Yuri D'Elia authored
-
- Dec 05, 2011
-
-
Morris Jette authored
Patch by Alexander Bersenev (Institute of Mathematics and Mechanics, Russia).
-
- Dec 02, 2011
-
-
Danny Auble authored
There was also some bad code that would reset the conn_type of a block to SMALL no matter what type of SMALL it was.
-
- Dec 01, 2011
-
-
jette authored
This was due to a bug in select/cons_res with some configuration optiions and job options, especially if there is more than one thread per core and the job option includes "--threads-per-core=1". Fixes problem reported by CSCS.
-
- Nov 30, 2011
-
-
Danny Auble authored
lock was deemed not necessary because the information (db_index) was only internal and was only modified in the same function later which is protected by the write lock.
-
Danny Auble authored
qos on the cluster to fill that in correctly.
-
Danny Auble authored
incorrectly when getting fairshare from the parent.
-
- Nov 23, 2011
-
-
Danny Auble authored
wasn't started at the time the eligible message was sent but started before the db_index was returned information like start time would be lost.
-
- Nov 22, 2011
-
-
Morris Jette authored
-
Morris Jette authored
-
- Nov 18, 2011
-
-
jette authored
Patch from Yuri D'Ella
-
- Nov 08, 2011
-
-
Morris Jette authored
Note this is an old bug. The new code keeps slurmstepd alive and it keeps trying to send step completion message to slurmctld.
-
- Nov 07, 2011
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
This make the same patch to select/linear as Carles Fenoy's patch to select/cons_res plugin.
-
Carles Fenoy authored
Added gres_cpus test. Without this test it could lead to the error "fatal: cons_res: sync loop not progressing" With this patch a job will be rejected if asking for unavailable configuration.
-
- Nov 04, 2011
-
-
Morris Jette authored
Print an error rather than setting CUDA_VISIBLE_DEVICES environment variable to "NoDevFiles" if no device files defined.
-
Morris Jette authored
Patch 4f68cde5bd6b4fcf839f6694457373c81d9548ba from chaos/slurm by Don Lipari, LLNL
-
Morris Jette authored
The change in function call order of commit e60abe43 resulted in slurmd daemons on front-end systems not registering with the proper node name.
-
- Nov 02, 2011
-
-
Morris Jette authored
-
- Oct 31, 2011
-
-
Morris Jette authored
-
Morris Jette authored
-
- Oct 28, 2011
-
-
Morris Jette authored
Backfill scheduling - Add SchedulerParameters configuration parameter of "bf_res" to control the resolution in the backfill scheduler's data about when jobs begin and end. Default value is 60 seconds (used to be 1 second).
-
Morris Jette authored
-
Morris Jette authored
Do not drain the compute or front-end node when trying to start a job for which the UID is not found
-
Morris Jette authored
Release locks on cray system after inventory and before backfill scheduling loop in order to not process more jobs and avoid blocking pending RPCs for so long
-
Morris Jette authored
-
- Oct 27, 2011
-
-
Morris Jette authored
-
Morris Jette authored
This patch contains corrections for spelling errors in the code and improvements for some man pages. Patch from Gennaro Oliva.
-
- Oct 25, 2011
-
-
Morris Jette authored
Patch by Stephen Trofinoff, CSCS.
-
- Oct 24, 2011
-
-
Morris Jette authored
Change the logging of a job step create failure due to SlurmctldProlog running from info() to debug() since this can be due to a race condition.
-
Morris Jette authored
Do not attempt to run HeathCheckProgram on powered down nodes. Patch from Ramiro Alba, Centre Tecnològic de Tranferència de Calor, Spain.
-
- Oct 21, 2011
-
-
Danny Auble authored
-
- Oct 20, 2011
-
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
-
Danny Auble authored
-
- Oct 19, 2011
-
-
Morris Jette authored
Report correct job "Reason" if needed nodes are DOWN, DRAINED, or NOT_RESPONDING, "Resources" rather than "PartitionNodeLimit".
-
Danny Auble authored
plugins in the slurmd.
-
Danny Auble authored
-