- May 27, 2016
-
-
Morris Jette authored
-
Morris Jette authored
This bug was introduced by commit 21c52d2f which fixed a different problem tracking resources associated with suspended jobs. There are subtle differences between jobs that are suspended by a user/administrator and jobs suspended by gang scheduling which resulted in undercounting allocated CPUs when a job suspended by gang scheduling was active at the same time of a slurmctld reconfiguration request. See bugs 2353 (original bug related to commit 21c52d2f and bug 2765
-
Danny Auble authored
accounts) no default account is printed, previously NULL was printed. This is just not printing it, but whole function should probably be revisited as the rigmarole can probably be avoided as we always know what the default is going to be if none is specified (first off the list). The problem with that though is if the user has been added to a cluster already and they have a default, but then added to a new cluster where they don't have a default. In this case you want to keep the first clusters default, but set the default for the second cluster. Bug 2725
-
Danny Auble authored
-
Tim Wickberg authored
Add missing unlock before return. Coverity 44888.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
srun was modified to log Warning: can't honor --ntasks-per-node set to 8 which doesn't match the requested tasks 23 with the number of requested nodes 3. Ignoring --ntasks-per-node. but this tests was parsing that warning, causing a test failure. Modify the test's parsing logic.
-
Morris Jette authored
This reverts commit cc242de3 That patch fixed bug 2745, but breaks tests 1.89 and 1.91 on typical Xeon processors
-
Morris Jette authored
-
Morris Jette authored
This bug was introduced by commit 21c52d2f which fixed a different problem tracking resources associated with suspended jobs. There are subtle differences between jobs that are suspended by a user/administrator and jobs suspended by gang scheduling which resulted in undercounting allocated CPUs when a job suspended by gang scheduling was active at the same time of a slurmctld reconfiguration request. See bugs 2353 (original bug related to commit 21c52d2f and bug 2765
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
accounts) no default account is printed, previously NULL was printed. This is just not printing it, but whole function should probably be revisited as the rigmarole can probably be avoided as we always know what the default is going to be if none is specified (first off the list). The problem with that though is if the user has been added to a cluster already and they have a default, but then added to a new cluster where they don't have a default. In this case you want to keep the first clusters default, but set the default for the second cluster. Bug 2725
-
Danny Auble authored
-
- May 26, 2016
-
-
Danny Auble authored
This also makes use of the -j (join) flag as a normal qsub does.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Fix for uninitialized variable in task binding logic, could leave tasks with fewer CPUs than intended. bug 2766
-
- May 25, 2016
-
-
Morris Jette authored
-
Morris Jette authored
Fix GRES task layout bug that could cause slurmctld to abort with invalid memory reference. bug 2766
-
Tim Wickberg authored
Add missing unlock before return. Coverity 44888.
-
Tim Wickberg authored
-
Tim Wickberg authored
Coverity 44891.
-
- May 24, 2016
-
-
Tim Wickberg authored
-
Tim Wickberg authored
sizeof(optarg) is incorrect, that's the size of the pointer not the length of the character string that must be parsed. Coverity 53128.
-
Tim Wickberg authored
Coverity 44992.
-
Artem Polyakov authored
bug 2760
-
Morris Jette authored
-
Tim Wickberg authored
Needs to unlock here, not re-lock the lock.
-
Morris Jette authored
Some tests fail if the partition is configured with Shared=exclusive. This change disables the same tests as if select/linear is configured.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Tim Wickberg authored
Prevent '--preserve' from being inadvertanly enabled by '-j'.
-
Tim Wickberg authored
-
Tim Wickberg authored
-
Morris Jette authored
Logic introduced in v16.05.0-rc2 could attempt state change for nid00000, even if no such node exists bug 2758
-
- May 23, 2016
-
-
Nicolas Joly authored
Still testing 16.05 on my NetBSD/amd64 workstation ... Just encountered a crash with scancel(1). njoly@lanfeust [~]> sbatch --wrap "sleep 3600" Submitted batch job 4680 njoly@lanfeust [~]> scancel 4680 scancel: Error detected by libpthread: Invalid condition variable. Detected by file "/local/src/NetBSD/src/lib/libpthread/pthread_cond.c", line 140, function "pthread_cond_timedwait". See pthread(3) for information. zsh: abort (core dumped) scancel 4680 Checking the code show indeed that pthread_cond_wait() call from scancel.c:_signal_job_by_str() use an uninitialised condition variable "num_active_threads_cond" The attached patch, which add the missing pthread_cond_init() seems to fix it. bug 2753
-