- Jun 11, 2014
-
-
Morris Jette authored
-
Morris Jette authored
Remove duplicate backfill scheduling tests. For example there is no need to test if a job can be started if the only difference from the previous test involves nodes in other partitions that can not be used by the job we are trying to start.
-
- Jun 10, 2014
-
-
Morris Jette authored
The backfill scheduler was always reporting the time that a job was being considered as NOW rather than the time that was really being considered.
-
David Bigagli authored
decreases and total is less than in use.
-
Danny Auble authored
-
jette authored
-
Morris Jette authored
-
Morris Jette authored
Improve how failures in slurmd/slurmstepd communications are logged.
-
- Jun 09, 2014
-
-
Morris Jette authored
mail messages for job array events print now use the job ID using the format "#_# (#)" rather than just the internal job ID.
-
Marlys Kohnke authored
Cray network.
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
Cray/ALPS system - Enable backup controller to run outside of the Cray to accept new job submissions and most other operations on the pending jobs.
-
David Gloe authored
* Handle a missing ALPS spool directory * Update debug statements to work like the switch plugin * Move some code into static functions * Keep track of the number of active job steps and call alpsc_node_app_epilogue
-
Morris Jette authored
Add child_forked() function to the slurm_acct_gather_profile plugin to close open files, leaving application with no extra open file descriptors.
-
David Bigagli authored
-
Morris Jette authored
Conflicts: src/slurmctld/job_mgr.c
-
Morris Jette authored
This will help limit damage from two active primary slurmctld (split brain problem).
-
- Jun 07, 2014
-
-
Morris Jette authored
-
David Bigagli authored
it is already running.
-
Morris Jette authored
-
Morris Jette authored
Conflicts: testsuite/expect/test7.9
-
Morris Jette authored
Duplicate triggers are not not allowed
-
Morris Jette authored
Job profiling leaves a file open
-
David Bigagli authored
job is JOB_COMPLETING or already pending.
-
- Jun 06, 2014
-
-
David Bigagli authored
last epilog completes, either slurmd epilog or slurmctld epilog, whichever comes last.
-
Morris Jette authored
Describe cgroup-based core specialization support.
-
Morris Jette authored
-
Martin Perry authored
-
David Bigagli authored
don't clear the dependency if the job is completing.
-
- Jun 05, 2014
-
-
Danny Auble authored
(Also remove extra pending check, no reason to check it twice ;))
-
Morris Jette authored
Conflicts: src/slurmctld/job_mgr.c
-
Morris Jette authored
If the backup slurmctld assumes primary status, then do NOT purge any job state files (batch script and environment files) but if any attempt is made to re-use them consider this a fatal error. It may indicate that multiple primary slurmctld daemons are active (e.g. both backup and primary are functioning as primary and there is a split brain problem).
-
Danny Auble authored
-
Morris Jette authored
Replace printing of job_id using %d with %u
-
Danny Auble authored
-
Morris Jette authored
Conflicts: src/slurmctld/slurmctld.h
-
Morris Jette authored
Test time when job_state file was written to detect multiple primary slurmctld daemons (e.g. both backup and primary are functioning as primary and there is a split brain problem).
-
Danny Auble authored
-