Skip to content
Snippets Groups Projects
  1. Sep 29, 2004
  2. Sep 27, 2004
  3. Sep 24, 2004
  4. Sep 21, 2004
  5. Sep 20, 2004
  6. Sep 17, 2004
  7. Sep 15, 2004
  8. Aug 20, 2004
  9. Aug 19, 2004
  10. Aug 17, 2004
    • Moe Jette's avatar
      Add logic to support Blue Gene/L: · 9bd07b7b
      Moe Jette authored
      Map all nodes in cluster to a single front-end node.
      Don't repeat ping/register/kill/etc. RPCs to all pseudo nodes, just the
        front-end.
      Treat single message for some RPCs as representing all nodes in the
        cluster: register, ping responce, epilog complete, etc.
      9bd07b7b
  11. Aug 10, 2004
  12. Aug 06, 2004
  13. Aug 04, 2004
  14. Jul 26, 2004
  15. Jul 23, 2004
  16. Jul 09, 2004
  17. Jun 02, 2004
  18. Apr 30, 2004
  19. Mar 20, 2004
  20. Mar 16, 2004
  21. Mar 11, 2004
  22. Jan 26, 2004
  23. Dec 31, 2003
  24. Dec 11, 2003
  25. Dec 05, 2003
    • Moe Jette's avatar
      If nodes vanish on a reconfig or slurmctld restart then: · 7ef7afe6
      Moe Jette authored
      gracefully kill all jobs allocated resources on those nodes,
      gracefully kill all pending jobs that require those nodes,
      leave pending jobs that exclude those nodes but ignore those nodes.
      Added "best_effort" argument to node_name2bitmap() function.
      Fix potential memory leak when maui scheduler interface resets
      the required nodes.
      (gnats:342)
      7ef7afe6
  26. Nov 25, 2003
  27. Nov 18, 2003
  28. Nov 13, 2003
  29. Nov 07, 2003
  30. Oct 29, 2003
    • Moe Jette's avatar
      Slurmctld now pings srun periodically. If srun fails to respond, the job · e80b2442
      Moe Jette authored
      and/or job step(s) will have their resources de-allocated and be killed.
      A resource allocation will not be release unless no job steps are active
      for at least InactiveLimit seconds. DPCS jobs will be subject to this
      forced de-allocation if they remain inactive for an extended period of
      time, which can get SLURM and DPCS back in sync if DPCS does a cold-start.
      e80b2442
  31. Oct 24, 2003
    • Moe Jette's avatar
      Remove _too_many_fragments() function since DPCS is now smart enough to · cbe21ab0
      Moe Jette authored
      avoid highly fragmented resource allocations.
      Add list of excluded nodes to job info dumpped and reported.
      Fix how mis-matched RPC version number are handled. Let error code get
      back to the API function.
      Dump job state information upon each job's termination via plugin.
      Re-issue incomplete write requests in job/partition state save.
      Make slurmctld continue proper operation without any default partition
      (gnats:317).
      Add command/RPC to delete a partition.
      Retry socket connection for slurmd/io.c as needed (gnats:253).
      cbe21ab0
    • jwindley's avatar
      Add --uid and --gid options to srun · fc50746d
      jwindley authored
      fc50746d
  32. Oct 11, 2003
  33. Oct 03, 2003
  34. Sep 29, 2003
  35. Sep 23, 2003
  36. Sep 21, 2003
Loading