- Sep 05, 2008
- Sep 03, 2008
-
-
- Sep 02, 2008
- Aug 29, 2008
-
-
Moe Jette authored
https://computing.llnl.gov/linux/slurm/power_save.html or "man slurm.conf" (SuspendProgram and related parameters) for more information. This is the final installment of the work: update some documenation, increase default ResumeRate, and reduce frequency of retrying batch launch state check.
-
Moe Jette authored
-
Moe Jette authored
that it doesn't get set DOWN right away for not responding since the last time it was powered up.
-
Moe Jette authored
Don't allocate nodes to a job step until it is responding (as before) AND the node is no longer in power save mode.
-
Moe Jette authored
"man slurm.conf" (SuspendProgram and related parameters) for more information. NOTE: the step create logic needs to be modify to return EAGAIN or the like until the node's state NOT_RESPONDING flag gets cleared.
-
- Aug 27, 2008
-
-
- Aug 18, 2008
-
-
Moe Jette authored
longer needed.
-
-
- Aug 14, 2008
-
-
-
Moe Jette authored
-
- Aug 12, 2008
- Aug 11, 2008
-
-
Danny Auble authored
-
Danny Auble authored
-
Moe Jette authored
ping the node immediately to clear the NOT_RESPONDING flag.
-
Joseph P. Donaghy authored
-
Danny Auble authored
-
Joseph P. Donaghy authored
-
- Aug 09, 2008
-
-
Moe Jette authored
-
- Aug 08, 2008
-
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
expression rather than one line per node. Frequency of log messages is dependent upon SlurmctldDebug value from 300 seconds at SlurmctldDebug<=3 to 1 second at SlurmctldDebug>=5.
-
Moe Jette authored
-
Moe Jette authored
-
Danny Auble authored
-
Danny Auble authored
-
- Aug 07, 2008
-
-
Danny Auble authored
-
Danny Auble authored
-
Joseph P. Donaghy authored
-
Danny Auble authored
-
Danny Auble authored
-