Skip to content
Snippets Groups Projects
  1. May 06, 2016
    • John Thiltges's avatar
      Fix for slurmstepd setfault · db0fe22e
      John Thiltges authored
      With slurm-15.08.10, we're seeing occasional segfaults in slurmstepd. The logs point to the following line: slurm-15.08.10/src/slurmd/slurmstepd/mgr.c:2612
      
      On that line, _get_primary_group() is accessing the results of getpwnam_r():
          *gid = pwd0->pw_gid;
      
      If getpwnam_r() cannot find a matching password record, it will set the result (pwd0) to NULL, but still return 0. When the pointer is accessed, it will cause a segfault.
      
      Checking the result variable (pwd0) to determine success should fix the issue.
      db0fe22e
  2. May 05, 2016
  3. May 04, 2016
  4. May 03, 2016
  5. May 02, 2016
  6. Apr 29, 2016
  7. Apr 28, 2016
  8. Apr 27, 2016
  9. Apr 26, 2016
  10. Apr 23, 2016
  11. Apr 20, 2016
  12. Apr 15, 2016
  13. Apr 14, 2016
    • Morris Jette's avatar
      Set burst buffer reason for job · 49d483db
      Morris Jette authored
      If a job fails stage in, set its reason to BurstBufferOperation
      with a string describing what happened. Previously the reason was
      set to AdminHeld on stage-in failure.
      49d483db
  14. Apr 13, 2016
Loading