Skip to content
Snippets Groups Projects
  1. Oct 05, 2016
  2. Oct 04, 2016
    • Morris Jette's avatar
      add knl.conf parameter CapmcRetries · 5cb90497
      Morris Jette authored
      Add new knl.conf configuration parameter CapmcRetries
      Modify capmc_suspend and capmc_resume to retry operations when
        Cray State Manager is down.
      Add retry logic to node_features/knl_cray to handle Cray State
        manager being down.
      bug 3100
      5cb90497
  3. Oct 03, 2016
  4. Sep 30, 2016
  5. Sep 29, 2016
  6. Sep 28, 2016
  7. Sep 27, 2016
  8. Sep 26, 2016
  9. Sep 24, 2016
  10. Sep 23, 2016
  11. Sep 22, 2016
  12. Sep 21, 2016
  13. Sep 20, 2016
  14. Sep 17, 2016
    • Morris Jette's avatar
      Restore ability to manually power down nodes · da722a89
      Morris Jette authored
      Restore ability to manually power down nodes, broken in 15.08.12
      in commit b4904661
      
      The patch introduced in commit b4904661 (not powering down dead node) has a bad side effect.  Adding the "(node_ptr->last_idle != 0)" condition prevents from powering down nodes with the following command:
      
      scontrol update nodename=nX state=power_down
      
      because the state update function relies on zeroing the "last_idle" variable when a power_down is requested (see src/slurmctld/node_mgr.c, line 1589).
      
      Reverting this commit should solve the problem...but I let you decide...
      
      Didier GAZEN
      da722a89
  15. Sep 16, 2016
  16. Sep 15, 2016
  17. Sep 14, 2016
  18. Sep 09, 2016
  19. Sep 08, 2016
    • Morris Jette's avatar
      Restructure srun task_exit logic · 6b6d4e1a
      Morris Jette authored
      Restructure srun command locking for task_exit processing logic for improved
        parallelism. This change decreases the amount of time consumed by serial
        logic by 2 orders of magnitude.
      bug 3044
      6b6d4e1a
Loading