- Dec 28, 2015
-
-
Morris Jette authored
Preemption/gang scheduling: If a job is suspended at slurmctld restart or reconfiguration time, then leave it suspended rather than resume+suspend. bug 2274
-
Morris Jette authored
bug 2276
-
- Dec 24, 2015
-
-
Morris Jette authored
Add cray-specific port configuration information which considers Realm-Specific IP Addressing (RSIP). bug 2277
-
- Dec 23, 2015
-
-
Morris Jette authored
Filter out some message commonly generated in high-throughput computing environments. bug 2282
-
Tim Wickberg authored
-
Morris Jette authored
-
Morris Jette authored
task/affinity: Disable core-level task binding if more CPUs required than available cores. bug 2267
-
Morris Jette authored
Improve the logging of job submit RPC contents; add descriptor/name to some of the fields printed rather than just raw numbers.
-
Morris Jette authored
Log as error if more than 3 aeld connects per second that cause is likely duplicate slurmctld daemon bug 2278
-
- Dec 22, 2015
-
-
Morris Jette authored
This is needed to properly enforce limits and account for usage.
-
- Dec 21, 2015
-
-
Tim Wickberg authored
-
Morris Jette authored
-
Brian Christiansen authored
-
Morris Jette authored
-
- Dec 19, 2015
-
-
John Hensley authored
Remove the 1024-character limit on lines in batch scripts, which was causing long lines to be silently truncated. I noticed it when jobs were getting created with fewer dependencies than specified. Also increase the line length when showing job info.
-
jette authored
No change in logic
-
- Dec 18, 2015
-
-
jette authored
sched/backfill: If a job can not be started within the configured backfill_window, set it's start time to 0 (unknown) rather than the end of the backfill_window. bug 2100
-
Danny Auble authored
-
Danny Auble authored
-
jette authored
If a pending job array has multiple reasons for being in a pending state, then print all reasons in a comma separated list. Before: JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON) 97354_[1-4] debug tmp jette PD 0:00 1 (Resources) After: JOBID PARTITION NAME USER ST TIME NODES NODELIST(REASON) 97354_[1-4] debug tmp jette PD 0:00 1 (Resources,JobHeldUser)
-
- Dec 17, 2015
-
-
Alejandro Sanchez authored
-
David Bigagli authored
-
Morris Jette authored
The tests were sometimes failing unless I gathered accounting data at more frequent intervals. This seems to work with default accounting collection intervals.
-
Morris Jette authored
When extra pools were reported by Cray API, it was not multiplying the buffer count by granularity to get the total size, but was just getting the counts.
-
Danny Auble authored
-
- Dec 16, 2015
-
-
Alejandro Sanchez authored
job's TimeLimit if the job did't request --time
-
David Matthews authored
-
Brian Christiansen authored
Bug 2130
-
Morris Jette authored
Move slurmctld mail handler to separate thread for improved performance. Original logic did fork/exec without separate thread and if the slurmctld memory size is huge, then the time required for fork() to complete can be significant. bug 2252
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Avoid hard-link/copy of script/environment files for job arrays. Use the master job record file for all tasks of the job array. back port of commit 41eead64 bug 2220
-
Morris Jette authored
Refactor logic to synchronize active bach jobs and their script/environment files, reducing overhead dramatically for large numbers of active jobs. backport of commit 41eead64 bug 2220
-
- Dec 15, 2015
-
-
Morris Jette authored
-
Morris Jette authored
If a job can never be started due to unsatisfied job dependencies, report the full original job dependency specification rather than the dependencies remaining to be satisfied (typically NULL). bug 2197
-
Danny Auble authored
in use.
-
Danny Auble authored
Conflicts: src/slurmctld/proc_req.c
-
Danny Auble authored
_slurm_rpc_complete_job_allocation. This is a rewrite of 438365ec which didn't catch the job_ptr wasn't in a lock so the memory issue could still of existed. This hopefully fixes all the spots the job_ptr wasn't in the lock. Fixes bug 2146
-
Danny Auble authored
This reverts commit 438365ec.
-
Brian Christiansen authored
Bug 2261
-