Skip to content
Snippets Groups Projects
  1. Oct 20, 2004
  2. Oct 16, 2004
  3. Oct 06, 2004
  4. Oct 05, 2004
  5. Sep 28, 2004
  6. Sep 27, 2004
  7. Sep 23, 2004
  8. Aug 24, 2004
  9. Aug 23, 2004
  10. Aug 18, 2004
  11. Aug 04, 2004
  12. Jul 09, 2004
  13. Jun 23, 2004
  14. May 11, 2004
  15. May 03, 2004
  16. Apr 30, 2004
  17. Apr 19, 2004
  18. Mar 11, 2004
  19. Feb 14, 2004
  20. Feb 13, 2004
  21. Feb 12, 2004
  22. Jan 30, 2004
  23. Dec 31, 2003
  24. Dec 08, 2003
  25. Nov 21, 2003
    • Mark Grondona's avatar
      o Merged fixes from slurm-0-2-branch at slurm-0-2-22-1 · e3716463
      Mark Grondona authored
         - fixes to help ensure slurmd uses the same key for shared memory
           on a restart (to avoid losing track of jobs)
         - slurmd only runs one launch thread at a time
         - fix bug in slurmd where multiple threads used same address space
           for connecting client address.
         - srun always sends SIGKILL to job step before issuing complete request
         - Changed short string for draining nodes to drng from drain.
         - srun default launch message timeout increased to 5s.
      e3716463
  26. Nov 18, 2003
  27. Nov 17, 2003
  28. Nov 14, 2003
  29. Nov 10, 2003
  30. Nov 07, 2003
  31. Nov 05, 2003
  32. Oct 29, 2003
    • Moe Jette's avatar
      Slurmctld now pings srun periodically. If srun fails to respond, the job · e80b2442
      Moe Jette authored
      and/or job step(s) will have their resources de-allocated and be killed.
      A resource allocation will not be release unless no job steps are active
      for at least InactiveLimit seconds. DPCS jobs will be subject to this
      forced de-allocation if they remain inactive for an extended period of
      time, which can get SLURM and DPCS back in sync if DPCS does a cold-start.
      e80b2442
  33. Oct 24, 2003
    • Moe Jette's avatar
      Remove _too_many_fragments() function since DPCS is now smart enough to · cbe21ab0
      Moe Jette authored
      avoid highly fragmented resource allocations.
      Add list of excluded nodes to job info dumpped and reported.
      Fix how mis-matched RPC version number are handled. Let error code get
      back to the API function.
      Dump job state information upon each job's termination via plugin.
      Re-issue incomplete write requests in job/partition state save.
      Make slurmctld continue proper operation without any default partition
      (gnats:317).
      Add command/RPC to delete a partition.
      Retry socket connection for slurmd/io.c as needed (gnats:253).
      cbe21ab0
  34. Oct 08, 2003
  35. Oct 01, 2003
  36. Sep 03, 2003
Loading