Skip to content
Snippets Groups Projects
  1. Apr 30, 2014
    • Morris Jette's avatar
      switch/nrt - CAU and RMDA tracking correction · 6f66fdef
      Morris Jette authored
      Switch/nrt - Properly track usage of CAU and RDMA resources with multiple
      tasks per compute node. Previous logic would allocate resources once per
      task and then deallocate once per node, leaking CMA and RDMA resources
      and preventing their use by future jobs.
      6f66fdef
    • Morris Jette's avatar
      ignore prio reset on held jobs · cbcea672
      Morris Jette authored
      If a job is held, then only release it with the "scontrol release <jobid>"
      command rather than a simple reset of the job's priority. This is needed to
      support job arrays better. Otherwise a priority reset of a job array
      would free all requeued/held jobs from that job array rather than
      leaving them held.
      cbcea672
  2. Apr 28, 2014
  3. Apr 26, 2014
  4. Apr 25, 2014
  5. Apr 24, 2014
  6. Apr 23, 2014
  7. Apr 22, 2014
  8. Apr 21, 2014
  9. Apr 19, 2014
  10. Apr 18, 2014
    • Morris Jette's avatar
      switch/nrt - free partial allocation · a197a1da
      Morris Jette authored
      On switch resource allocation failure, free partial allocation.
      Failure mode was CAU could be allocated on some nodes, but not
      others. The CAU allocated on nodes and switches up to the failure
      point were never released.
      a197a1da
    • Morris Jette's avatar
      Job array scheduling bug · c075ac75
      Morris Jette authored
      Don't block scheduling of entire job array if it could run in multiple
      partitions.
      bug 726
      c075ac75
  11. Apr 17, 2014
  12. Apr 16, 2014
Loading