- Mar 10, 2016
-
-
Tim Wickberg authored
Was incorrectly displaying "(null)" even when loaded successfully.
-
Morris Jette authored
possible bug in smap Makefile
-
Manuel Rodríguez-Pascual authored
LIBS can have a previous value, as depicted in ./configure --help "Some influential environment variables: (...) LIBS libraries to pass to the linker, e.g. -l<library> " Original assignation to LIBS overwrites this value. With this edition, the user defined flags and NCURSES ones are both employed by the linker.
-
- Mar 09, 2016
-
-
Morris Jette authored
This matches the documentation
-
- Mar 08, 2016
-
-
Morris Jette authored
Capture MCDRAM percentages for various configurations from capmc. This assumes the percentages for various configurations will be identical for all nodes within a cluster.
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Janne Blomqvist authored
-
- Mar 07, 2016
-
-
Morris Jette authored
-
Brian Christiansen authored
clang found a deferencing null issue which lead to finding the parsing error.
-
Dominik Bartkiewicz authored
Added new job dependency type of "aftercorr" which will start a task of a job array after the corresponding task of another job array completes. bug 2460
-
Tim Wickberg authored
In particular, it seems that MariaDB has changed the default for innodb_lock_wait_timeout has been lowered which can cause issues for the various rollup processes on systems with high job counts.
-
- Mar 05, 2016
-
-
Morris Jette authored
Fix some timing issues with respect to rebooting a node, especailly KNL node needing reboot to change configuration.
-
Danny Auble authored
would only track gres/gpu, now it will track both gres/gpu and gres/gpu:tesla as separate gres if configured like AccountingStorageTRES=gres/gpu,gres/gpu:tesla
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
--gres=gpu:tesla before you needed to give a count --gres=gpu:tesla:1 now both should work.
-
- Mar 04, 2016
-
-
Danny Auble authored
-
Danny Auble authored
Step GRES value changed from type "int" to "int64_t" to support larger values. Signed-off-by:
Danny Auble <da@schedmd.com>
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
These changes apply to both the main scheduling logic and backfill scheduler. If some SchedulerParameters value was configured, the slurmctld started, then completely removed, and slurmctld reconfigured the value would not be reset to it's default value but the originally configured value would persist until slurmctld restarted.
-
Brian Christiansen authored
Continuation of 31225a82
-
Morris Jette authored
Harden code to not fail if node_bitmap passed to _update_node_gres() has no bits set.
-
Brian Christiansen authored
-
Brian Christiansen authored
Bug 2430
-
- Mar 03, 2016
-
-
Morris Jette authored
This may be helpful for timing purposes. Added by Cray request.
-
Morris Jette authored
Unless a job is running in --multi-prog mode, modify the logic to resolve the job's path once rather than once for each task. This may slightly improve performance (requested by Cray).
-
Danny Auble authored
of it's very close version.
-
Thomas Hamel authored
We want to introduce a new behavior in the way slurmd uses the HealthCheckProgram. The idea is to avoid a race condition between the first HealthCheckProgram run and the node accepting jobs. The slurmd daemon will initialize and then loop on HealthCheckProgram execution before registering with slurmctld. It will stay in this loop until the HealthCheckProgram returns successfully (the node is still DOWN). On our clusters we are using NHC as an HealthCheckProgram. NHC drains the node if it fails and remove the drain if it is successfull, this behavior fits well with our purpose. This behavior permits us to start slurmd at boot without setting up a complex boot sequence in the init system, slurmd just wait for the node to be ready before registering. The HealthCheckProgram is not run during slurmd startup if HealthCheckInteval is 0.
-
Danny Auble authored
-
Danny Auble authored
-
Brian Christiansen authored
Bug 2507
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Step GRES value changed from type "int" to "int64_t" to support larger values. Previous logic could fail in step allocation values over 32-bits. Other GRES values are 64-bit.
-
Tim Wickberg authored
-