- Oct 20, 2015
-
-
Morris Jette authored
If a suspended job is manually resumed and gang scheduling is configured, but no time slices are available for the job being resumed, then just resume it without adding it to a time slice. The jobs previously running on those nodes will be replaced with new jobs as resources become available and the resumed job will basically be treated like a stray job. bug 2031
-
Morris Jette authored
Avoid reporting more allocated CPUs than exist on a node. This can be triggered by resuming a previosly suspended job, resulting in oversubscription of CPUs. bug 2021
-
Morris Jette authored
bug 2031
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
Add scancel -f/--full option to signal all steps including batch script and all of its child processes. bug 2031
-
Morris Jette authored
-
Morris Jette authored
-
- Oct 19, 2015
-
-
Brian Christiansen authored
Bug 1888
-
Danny Auble authored
out. Remove unneeded code that commit 8274ea54 fixed. This code would 0 out all GRES/TRES on a reconfig which isn't what we want. 8274ea54 does the right thing by itself.
-
Hongjia Cao authored
bug 2032
-
Deric Sullivan authored
bug 2039
-
Deric Sullivan authored
bug 2037
-
Morris Jette authored
-
Deric Sullivan authored
backport of commit 4f2e2801 from v16.05
-
Morris Jette authored
Needed to change a couple of variables from 32- to 64-bit.
-
Morris Jette authored
Add new burst_buffer.conf parameters: ValidateTimeout and OtherTimeout. See man page for details.
-
Morris Jette authored
-
- Oct 16, 2015
-
-
David Bigagli authored
-
Josko Plazonic authored
-
- Oct 15, 2015
-
-
Danny Auble authored
previously take 2 restarts of the slurmdbd to make it stick correctly.
-
- Oct 14, 2015
-
-
Danny Auble authored
single-threaded cores. A regression caused only 1 socket to be used on this kind of node instead of all that were available.
-
- Oct 09, 2015
-
-
Morris Jette authored
-
Morris Jette authored
If a job allocation returns some invalid contents, the pointer to the job structure may be NULL. This change preserves the error message and avoids a segv.
-
- Oct 08, 2015
-
-
Brian Christiansen authored
Fix case where if the backup slurmdbd has existing connections when it gives up control that the it would be killed. If the backup had existing connections when giving up control, it would try to signal the existing threads by using pthread_kill to send SIGKILL to the threads. The problem is that SIGKILL doesn't go the thread but the main process and the backup dbd would be killed.
-
Danny Auble authored
when a cold-start (-c) happens to the slurmctld.
-
- Oct 07, 2015
-
-
Danny Auble authored
Conflicts: src/sacct/options.c
-
Danny Auble authored
-
Danny Auble authored
from a user. This would cause the slurmctld to cache the old default which wasn't valid and cause the user to have to request the association always.
-
Danny Auble authored
Conflicts: NEWS src/plugins/accounting_storage/mysql/as_mysql_job.c
-
Morris Jette authored
bug 2009
-
Morris Jette authored
Each node could have fewer tasks allocated on a node than the plane size, which broke the test. The plane size needs to be treated as a maximum consecutive rank value.
-
Thomas Cadeau authored
-
Morris Jette authored
-
Morris Jette authored
byg 2013
-
David Bigagli authored
-
Hongjia Cao authored
-
David Bigagli authored
-
Hongjia Cao authored
-
Danny Auble authored
database but the start record hadn't made it yet.
-