Skip to content
Snippets Groups Projects
Select Git revision
  • 0c9b684c5b3133d275369530dfdfbe405a593307
  • slurm_metricq default protected
2 results

NEWS

Blame
  • NEWS 183.98 KiB
    This file describes changes in recent versions of SLURM. It primarily
    documents those changes that are of interest to users and admins.
    
    * Changes in SLURM 1.4.0-pre6
    =============================
     -- Fix job preemption when sched/gang and select/linear are configured with
        non-sharing partitions.
     -- In select/cons_res insure that required nodes have available resources.
    
    * Changes in SLURM 1.4.0-pre5
    =============================
     -- Correction in setting of SLURM_CPU_BIND environment variable.
     -- Rebuild slurmctld's job select_jobinfo->node_bitmap on restart/reconfigure
        of the daemon rather than restoring the bitmap since the nodes in a system
        can change (be added or removed).
     -- Add configuration option "--with-cpusetdir=PATH" for non-standard 
        locations.
     -- Get new multi-core data structures working on BlueGene systems.
     -- Modify PMI_Get_clique_ranks() to return an array of integers rather 
        than a char * to satisfy PMI standard. Correct logic in 
        PMI_Get_clique_size() for when srun --overcommit option is used.
     -- Fix bug in select/cons_res, allocated a job all of the processors on a 
        node when the --exclusive option is specified as a job submit option.
     -- Add NUMA cpu_bind support to the task affinity plugin. Binds tasks to
        a set of CPUs that belong NUMA locality domain with the appropriate
        --cpu-bind option (ldoms, rank_ldom, map_ldom, and mask_ldom), see
        "man srun" for more information.
    
    * Changes in SLURM 1.4.0-pre4
    =============================
     -- For task/affinity, force jobs to use a particular task binding by setting
        the TaskPluginParam configuration parameter rather than slurmd's
        SLURM_ENFORCED_CPU_BIND environment variable.
     -- Enable full preemption of jobs by partition with select/cons_res 
        (cons_res_preempt.patch from Chris Holmes, HP).
     -- Add configuration parameter DebugFlags to provide detailed logging for
        specific subsystems (steps and triggers so far).
     -- srun's --no-kill option is passed to slurmctld so that a job step is 
        killed even if the node where srun executes goes down (unless the 
        --no-kill option is used, previous termination logic would fail if 
        srun was not responding).
     -- Transfer a job step's core bitmap from the slurmctld to the slurmd
        within the job step credential.
     -- Add cpu_bind, cpu_bind_type, mem_bind and mem_bind_type to job allocation
        request and job_details structure in slurmctld. Add support to --cpu_bind
        and --mem_bind options from salloc and sbatch commands.
    
    * Changes in SLURM 1.4.0-pre3
    =============================
     -- Internal changes: CPUs per node changed from 32-bit to 16-bit size.
        Node count fields changed from 16-bit to 32-bit size in some structures.
     -- Remove select plugin functions select_p_get_extra_jobinfo(),
        select_p_step_begin() and select_p_step_fini().
     -- Remove the following slurmctld job structure fields: num_cpu_groups,
        cpus_per_node, cpu_count_reps, alloc_lps_cnt, alloc_lps, and used_lps.
        Use equivalent fields in new "select_job" structure, which is filled
        in by the select plugins.
     -- Modify mem_per_task in job step request from 16-bit to 32-bit size.
        Use new "select_job" structure for the job step's memory management.
     -- Add core_bitmap_job to slurmctld's job step structure to identify
        which specific cores are allocated to the step.
     -- Add new configuration option OverTimeLimit to permit jobs to exceed 
        their (soft) time limit by a configurable amount. Backfill scheduling
        will be based upon the soft time limit.
     -- Remove select_g_get_job_cores(). That data is now within the slurmctld's
        job structure.
    
    * Changes in SLURM 1.4.0-pre2
    =============================
     -- Remove srun's --ctrl-comm-ifhn-addr option (for PMI/MPICH2). It is no