- Aug 23, 2016
-
-
David Gloe authored
The attached patch switches to a more reliable method of detecting service nodes, using xtcli status. In addition, it switches to the print function to be better compatible with python 3.
-
- Aug 22, 2016
-
-
Boris Karasev authored
-
Danny Auble authored
-
Danny Auble authored
multiple versions.
-
Boris Karasev authored
on the most recent pmix_vX.so file. One is free to directly use new "pmix_vX" plugin names as well. NOTE: there is one small artifact introduced by this fix - there will be no "pmix" name in the output of "srun --mpi=list" but the most recent plugin will be duplicated: $ ./srun --mpi=list srun: MPI types are... srun: mpi/none srun: mpi/mvapich srun: mpi/mpich1_shmem srun: mpi/mpichgm srun: mpi/pmix_v1 <---- srun: mpi/mpichmx srun: mpi/lam srun: mpi/pmi2 srun: mpi/openmpi srun: mpi/mpich1_p4 srun: mpi/pmix_v1 <----
-
Danny Auble authored
conflicting pmix versions.
-
Danny Auble authored
-
Danny Auble authored
-
Boris Karasev authored
-
Danny Auble authored
linking against.
-
Danny Auble authored
commit 20a73c508e1.
-
Danny Auble authored
of what is in the header file of pmix.
-
Danny Auble authored
-
Boris Karasev authored
To ease the distribution process, plugin names will be automatically adjusted to identify the version of API that it can support, ie: pmix_v1 and pmix_v2. This provides the ability for distro's to create separate non-conflicting packages for each API generation. Bug 2986
-
- Aug 20, 2016
-
-
Morris Jette authored
Insure reported expected job start time is not in the past for pending jobs. bug 3002
-
- Aug 19, 2016
-
-
Tim Wickberg authored
-
Morris Jette authored
burst_buffer/cray: Requeue, but do not hold a job which fails the pre_run operation. bug 3009
-
- Aug 18, 2016
-
-
Alejandro Sanchez authored
Bug 2984.
-
- Aug 17, 2016
-
-
Morris Jette authored
-
- Aug 16, 2016
-
-
Alejandro Sanchez authored
Only mark job_id as zero for batch step (when all job steps would be cleared), not for individual steps which prevented successive steps from being cancelled. Bug 2984.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
slurmstepd modified to pre-load all relevant plugins at startup to avoid the possibility of modified plugins later resulting in inconsistent API or data structures and a failure of slurmstepd. bug 2334
-
- Aug 15, 2016
-
-
Danny Auble authored
-
- Aug 12, 2016
-
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
-
- Aug 11, 2016
-
-
Morris Jette authored
bug 2655
-
Morris Jette authored
Add more details to the log messages for improved ability to diagnose problems.
-
Tim Wickberg authored
Bug 2983.
-
Tim Wickberg authored
Continuation of 18947480. Commit cff2effc made cpu_freq_init unconditionally set up the structures, so this reinitialization was discarding the true original info, which prevents the frequency and governor from being restored properly after the step completes.
-
Morris Jette authored
Don't about step launch if job reaches expected end time while node is configuring/booting (NOTE: The job end time will be adjusted after node becomes ready for use). bug 2985
-
Morris Jette authored
The sleep starts at 0.1 second and doubles each time to a maximum value. Previous maximum was 2 seconds, increased to 5 seconds to account for very large KNL boot times (tens of minutes) and to limit overhead on slurmctld daemon.
-
- Aug 10, 2016
-
-
Morris Jette authored
If the KNL node modes reported by cnselect and capmc differ, log that using "info" rather than "debug".
-
Danny Auble authored
frequency other than AcctGatherNodeFreq.
-
Danny Auble authored
Filesystem instead of Lustre.
-
Iakovos Panourgias authored
Network dataset.
-
Morris Jette authored
Locking slurmstepd in memory can result in exhausting real memory in some cases, resulting in failure of the slurmstepd process. This reverts commit 03cf4a5d, but the logic will be returned using a configuration parameter in Slurm version 17.02. bug 2334
-
Morris Jette authored
This should improve performance and prevent failure if a local group ID lookup fails. bug 2928
-