Skip to content
Snippets Groups Projects
Select Git revision
  • 39ea34404e4518997c83ed927fbbe30b85ca0405
  • slurm_metricq default protected
2 results

NEWS

Blame
  • NEWS 309.77 KiB
    This file describes changes in recent versions of Slurm. It primarily
    documents those changes that are of interest to users and administrators.
    
    * Changes in Slurm 14.11.9
    ==========================
     -- Correct "sdiag" backfill cycle time calculation if it yields locks. A
        microsecond value was being treated as a second value resulting in an
        overflow in the calcuation.
     -- Fix segfault when updating timelimit on jobarray task.
     -- Fix to job array update logic that can result in a task ID of 4294967294.
     -- Fix of job array update, previous logic could fail to update some tasks
        of a job array for some fields.
     -- CRAY - Fix seg fault if a blade is replaced and slurmctld is restarted.
     -- Fix plane distribution to allocate in blocks rather than cyclically.
     -- squeue - Remove newline from job array ID value printed.
     -- squeue - Enable filtering for job state SPECIAL_EXIT.
     -- Prevent job array task ID being inappropriately set to NO_VAL.
     -- MYSQL - Make it so you don't have to restart the slurmctld
        to gain the correct limit when a parent account is root and you
        remove a subaccount's limit which exists on the parent account.
     -- MYSQL - Close chance of setting the wrong limit on an association
        when removing a limit from an association on multiple clusters
        at the same time.
     -- MYSQL - Fix minor memory leak when modifying an association but no
        change was made.
     -- srun command line of either --mem or --mem-per-cpu will override both the
        SLURM_MEM_PER_CPU and SLURM_MEM_PER_NODE environment variables.
     -- Prevent slurmctld abort on update of advanced reservation that contains no
        nodes.
     -- ALPS - Revert commit 2c95e2d22 which also removes commit 2e2de6a4 allowing
        cray with the SubAllocate option to work as it did with 2.5.
     -- Properly parse CPU frequency data on POWER systems.
     -- Correct sacct.a man pages describing -i option.
     -- Capture salloc/srun information in sdiag statistics.
     -- Fix bug in node selection with topology optimization.
     -- Don't set distribution when srun requests 0 memory.
     -- Read in correct number of nodes from SLURM_HOSTFILE when specifying nodes
        and --distribution=arbitrary.
     -- Fix segfault in Bluegene setups where RebootQOSList is defined in
        bluegene.conf and accounting is not setup.
     -- MYSQL - Update mod_time when updating a start job record or adding one.
     -- MYSQL - Fix issue where if an association id ever changes on at least a
        portion of a job array is pending after it's initial start in the
        database it could create another row for the remain array instead
        of using the already existing row.
     -- Fix scheduling anomaly with job arrays submitted to multiple partitions,
        jobs could be started out of priority order.
     -- If a host has suspened jobs do not reboot it. Reboot only hosts
        with no jobs in any state.
     -- ALPS - Fix issue when using --exclusive flag on srun to do the correct
        thing (-F exclusive) instead of -F share.
     -- Fix various memory leaks in the Perl API.
     -- Fix a bug in the controller which display jobs in CF state as RUNNING.
     -- Preserve advanced _core_ reservation when nodes added/removed/resized on
        slurmctld restart. Rebuild core_bitmap as needed.
     -- Fix for non-standard Munge port location for srun/pmi use.
     -- Fix gang scheduling/preemption issue that could cancel job at startup.
     -- Fix a bug in squeue which prevented squeue -tPD to print array jobs.
     -- Sort job arrays in job queue according to array_task_id when priorities are
        equal.
    
    * Changes in Slurm 14.11.8
    ==========================
     -- Eliminate need for user to set user_id on job_update calls.
     -- Correct list of unavailable nodes reported in a job's "reason" field when
        that job can not start.
     -- Map job --mem-per-cpu=0 to --mem=0.
     -- Fix squeue -o %m and %d unit conversion to Megabytes.
     -- Fix issue with incorrect time calculation in the priority plugin when
        a job runs past it's time limit.