- Mar 16, 2012
-
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Danny Auble authored
to a bluegene cluster.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
already pinged it on startup the unresponding flag would be removed from the frontend node.
-
Danny Auble authored
-
Danny Auble authored
mark front end node down.
-
Danny Auble authored
-
- Mar 15, 2012
-
-
Danny Auble authored
state change while the realtime server is running.
-
Danny Auble authored
running on.
-
Danny Auble authored
-
Danny Auble authored
mark front end node down.
-
- Mar 14, 2012
-
-
Morris Jette authored
Cray - For srun wrapper when creating a job allocation, set the default job name to the executable file's name. Ignore leading directory names in the path.
-
Morris Jette authored
Cray - Enable logging of BASIL communications with environment variables. Set XML_LOG to enable logging. Set XML_LOG_LOC to specify path to log file or "SLURM" to write to SlurmctldLogFile or unset for "slurm_basil_xml.log". Based on work by Steve Tronfinoff, CSCS.
-
- Mar 13, 2012
-
-
Morris Jette authored
permit the srun and salloc commands to be executed in the background on Cray systems
-
Morris Jette authored
permit the srun and salloc commands to be executed in the background on Cray systems
-
Morris Jette authored
Add new job state reason of "FrontEndDown" which applies only to Cray and IBM BlueGene systems.
-
Danny Auble authored
-
Danny Auble authored
-
- Mar 12, 2012
-
-
Danny Auble authored
the queue when trying to place a larger than midplane job.
-
- Mar 09, 2012
-
-
Danny Auble authored
-
- Mar 07, 2012
-
-
Danny Auble authored
an admin updates the node to idle/resume the compute nodes will go instantly to idle instead of idle* which means no response.
-
- Mar 06, 2012
-
-
Danny Auble authored
gone. Previously it had a timelimit which has proven to not be the right thing.
-
Danny Auble authored
-
- Mar 02, 2012
-
-
Morris Jette authored
In cray/srun wrapper, only include aprun "-q" option when srun "--quiet" option is used.
-
- Feb 29, 2012
-
-
Morris Jette authored
-
- Feb 28, 2012
-
-
Morris Jette authored
-
- Feb 24, 2012
-
-
Morris Jette authored
Change default SchedulerParameters max_switch_wait field value from 60 to 300 seconds.
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Morris Jette authored
-
- Feb 23, 2012
-
-
Danny Auble authored
-
- Feb 20, 2012
-
-
jette authored
Patch from Aleksej Saushev.
-
- Feb 17, 2012
-
-
Danny Auble authored
CnodeCount/CnodeErrCount so to point out there are cnodes in an error state on the block. Draining the block and having it reboot when all jobs are gone will clear up the cnodes in Software Failure.
-
- Feb 16, 2012
-
-
Danny Auble authored
for a long time after the SLURM job has been flushed from the system we don't have to worry about rebooting the block to sync the system.
-
- Feb 11, 2012
-
-
Danny Auble authored
blocks.
-
- Feb 06, 2012
-
-
Danny Auble authored
are full allocation jobs, and others that are smaller.
-