Skip to content
Snippets Groups Projects
  1. Jan 30, 2004
  2. Dec 31, 2003
  3. Dec 08, 2003
  4. Nov 21, 2003
    • Mark Grondona's avatar
      o Merged fixes from slurm-0-2-branch at slurm-0-2-22-1 · e3716463
      Mark Grondona authored
         - fixes to help ensure slurmd uses the same key for shared memory
           on a restart (to avoid losing track of jobs)
         - slurmd only runs one launch thread at a time
         - fix bug in slurmd where multiple threads used same address space
           for connecting client address.
         - srun always sends SIGKILL to job step before issuing complete request
         - Changed short string for draining nodes to drng from drain.
         - srun default launch message timeout increased to 5s.
      e3716463
  5. Nov 18, 2003
  6. Nov 17, 2003
  7. Nov 14, 2003
  8. Nov 10, 2003
  9. Nov 07, 2003
  10. Nov 05, 2003
  11. Oct 29, 2003
    • Moe Jette's avatar
      Slurmctld now pings srun periodically. If srun fails to respond, the job · e80b2442
      Moe Jette authored
      and/or job step(s) will have their resources de-allocated and be killed.
      A resource allocation will not be release unless no job steps are active
      for at least InactiveLimit seconds. DPCS jobs will be subject to this
      forced de-allocation if they remain inactive for an extended period of
      time, which can get SLURM and DPCS back in sync if DPCS does a cold-start.
      e80b2442
  12. Oct 24, 2003
    • Moe Jette's avatar
      Remove _too_many_fragments() function since DPCS is now smart enough to · cbe21ab0
      Moe Jette authored
      avoid highly fragmented resource allocations.
      Add list of excluded nodes to job info dumpped and reported.
      Fix how mis-matched RPC version number are handled. Let error code get
      back to the API function.
      Dump job state information upon each job's termination via plugin.
      Re-issue incomplete write requests in job/partition state save.
      Make slurmctld continue proper operation without any default partition
      (gnats:317).
      Add command/RPC to delete a partition.
      Retry socket connection for slurmd/io.c as needed (gnats:253).
      cbe21ab0
  13. Oct 08, 2003
  14. Oct 01, 2003
  15. Sep 03, 2003
  16. Jul 30, 2003
  17. Jul 10, 2003
  18. Jul 04, 2003
  19. Mar 27, 2003
  20. Mar 24, 2003
  21. Mar 21, 2003
  22. Mar 06, 2003
  23. Mar 04, 2003
  24. Jan 31, 2003
  25. Jan 10, 2003
  26. Jan 09, 2003
  27. Dec 18, 2002
  28. Nov 22, 2002
  29. Nov 12, 2002
  30. Oct 24, 2002
  31. Oct 23, 2002
Loading