Skip to content
Snippets Groups Projects
  1. Sep 17, 2016
    • Morris Jette's avatar
      Restore ability to manually power down nodes · da722a89
      Morris Jette authored
      Restore ability to manually power down nodes, broken in 15.08.12
      in commit b4904661
      
      The patch introduced in commit b4904661 (not powering down dead node) has a bad side effect.  Adding the "(node_ptr->last_idle != 0)" condition prevents from powering down nodes with the following command:
      
      scontrol update nodename=nX state=power_down
      
      because the state update function relies on zeroing the "last_idle" variable when a power_down is requested (see src/slurmctld/node_mgr.c, line 1589).
      
      Reverting this commit should solve the problem...but I let you decide...
      
      Didier GAZEN
      da722a89
  2. Sep 16, 2016
  3. Sep 15, 2016
  4. Sep 14, 2016
  5. Sep 09, 2016
  6. Sep 08, 2016
    • Morris Jette's avatar
      Restructure srun task_exit logic · 6b6d4e1a
      Morris Jette authored
      Restructure srun command locking for task_exit processing logic for improved
        parallelism. This change decreases the amount of time consumed by serial
        logic by 2 orders of magnitude.
      bug 3044
      6b6d4e1a
  7. Sep 07, 2016
    • Morris Jette's avatar
      Preserve node "RESERVATION" state · 5eee1d28
      Morris Jette authored
      Preserve node "RESERVATION" state when one of multiple overlapping
          reservations ends. Previous logic would clear the node's
          RESERVATION state flag when any one of the reservations on the
          node ended rather than keeping the node in RESERVATION state
          until the last reservation ended.
      bug 3057
      5eee1d28
    • Morris Jette's avatar
      Handle slurmctld restart while compute node reboot request in progress · 4517c454
      Morris Jette authored
      Handle case when slurmctld daemon restart while compute node reboot in
          progress. Return node to service rather than setting DOWN.
      bug 3042
      4517c454
  8. Sep 06, 2016
  9. Sep 02, 2016
  10. Sep 01, 2016
  11. Aug 30, 2016
  12. Aug 27, 2016
  13. Aug 26, 2016
  14. Aug 25, 2016
    • Morris Jette's avatar
      Corrections to gres.conf parsing logic · dbfd87e4
      Morris Jette authored
      If all GRES were not defined on all nodes OR if a regular expression was used
         for a GRES file configuration (e.g. in gres.conf
         "Type=gpu Files=/dev/nvidia[0-4]"), then memory corruption was likely.
         The logic has been bad since its inception several years ago.
      dbfd87e4
  15. Aug 24, 2016
  16. Aug 23, 2016
  17. Aug 22, 2016
  18. Aug 20, 2016
  19. Aug 19, 2016
  20. Aug 17, 2016
  21. Aug 16, 2016
  22. Aug 15, 2016
  23. Aug 12, 2016
  24. Aug 11, 2016
Loading