Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
S
Slurm
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
tud-zih-energy
Slurm
Commits
939a03c0
Commit
939a03c0
authored
21 years ago
by
Moe Jette
Browse files
Options
Downloads
Patches
Plain Diff
Record updates for 0.2.17 release.
parent
38c62432
No related branches found
Branches containing commit
No related tags found
No related merge requests found
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
NEWS
+20
-0
20 additions, 0 deletions
NEWS
with
20 additions
and
0 deletions
NEWS
+
20
−
0
View file @
939a03c0
...
...
@@ -4,12 +4,32 @@ documents those changes that are of interest to users and admins.
* Changes in SLURM 0.2.17
=========================
-- Fixes for reported problems:
- slurm/279: Hold jobs that can't execute due to DOWN or DRAINED
nodes and release when nodes are returned to service.
- slurm/285: "srun killed due to SIGPIPE"
-- Support for running job steps on nodes relative to current
allocation via srun -r, --relative=n option.
-- SIGKILL no longer broadcasted to job via srun on task failure unless
--no-allocate option is used.
-- Re-enabled "chkconfig --add" in default RPMs.
-- Backup controller setting proper PID into slurmctld.pid file.
-- Backup controller restores QSW state each time it assumes control
-- Backup controller purges old job records before assuming control
to avoid resurrecting defunct jobs.
-- Kill jobs on non-responding DRAINING nodes and make their state
DRAINED.
-- Save state upon completion of a job's last EPILOG_COMPLETION to
reduce possibility of inconsistent job and node records when the
controller is transitioning between primary and backup.
-- Change logging level of detailed communication errors to not print
them unless detailed debugging is requested.
-- Increase number of concurrent controller server threads from 20
to 50 and restructure code to handle backlogs more efficiently.
-- Partition state at controller startup is based upon slurm.conf
rather than previously saved state. Additional improvements to
avoid inconsistent job/node/partition states at restart. Job state
information is used to arbitrate conflicts.
-- Orphaned file descriptors eliminated.
* Changes in SLURM 0.2.16
=========================
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment