Skip to content
Snippets Groups Projects
  1. May 09, 2016
  2. May 06, 2016
    • Morris Jette's avatar
      Add another explanation for test failure · b5dabfe8
      Morris Jette authored
      b5dabfe8
    • John Thiltges's avatar
      Fix for slurmstepd setfault · db0fe22e
      John Thiltges authored
      With slurm-15.08.10, we're seeing occasional segfaults in slurmstepd. The logs point to the following line: slurm-15.08.10/src/slurmd/slurmstepd/mgr.c:2612
      
      On that line, _get_primary_group() is accessing the results of getpwnam_r():
          *gid = pwd0->pw_gid;
      
      If getpwnam_r() cannot find a matching password record, it will set the result (pwd0) to NULL, but still return 0. When the pointer is accessed, it will cause a segfault.
      
      Checking the result variable (pwd0) to determine success should fix the issue.
      db0fe22e
  3. May 05, 2016
  4. May 04, 2016
  5. May 03, 2016
  6. May 02, 2016
  7. Apr 29, 2016
  8. Apr 28, 2016
  9. Apr 27, 2016
  10. Apr 26, 2016
  11. Apr 23, 2016
  12. Apr 20, 2016
  13. Apr 15, 2016
  14. Apr 14, 2016
    • Morris Jette's avatar
      Set burst buffer reason for job · 49d483db
      Morris Jette authored
      If a job fails stage in, set its reason to BurstBufferOperation
      with a string describing what happened. Previously the reason was
      set to AdminHeld on stage-in failure.
      49d483db
Loading