Skip to content
Snippets Groups Projects
  1. Aug 18, 2004
  2. Aug 17, 2004
    • Moe Jette's avatar
      Add logic to support Blue Gene/L: · 9bd07b7b
      Moe Jette authored
      Map all nodes in cluster to a single front-end node.
      Don't repeat ping/register/kill/etc. RPCs to all pseudo nodes, just the
        front-end.
      Treat single message for some RPCs as representing all nodes in the
        cluster: register, ping responce, epilog complete, etc.
      9bd07b7b
  3. Jun 22, 2004
  4. Jun 09, 2004
  5. Jun 04, 2004
  6. Jun 02, 2004
  7. Apr 30, 2004
  8. Apr 23, 2004
    • Moe Jette's avatar
      Add support for AIX systems: · 55e62ab4
      Moe Jette authored
      * Memory leak in slurm_cred.c, added EVP_MD_CTX_cleanup().
      * Pthread stack size too small on AIX. Resulting in stack corruption
        and ugly failure modes. Added slurm_attr_init to macros.h to
        explicitly set the stack size  for all pthreads.
      * /dev/urandom not present on AIX, use rand() as needed instead in
        constructing a credential. Used in "srun --join".
      * getsockopt(Socket, Level, SO_ERROR, &err, OptionLenght) sometime
        returns an error code of -1. This causes an assert failure in
        slurmd/io.c:_update_error_state().
      * Function aliasing is not working on AIX. It is being turned off
        via a variable in config.h and "#if" logic in macros.h and
        slurm_xlator.h.
      * dlopen failing if plugins reference any functions not present in
        caller. This may be fixed with the LDFLAG "-Wl,-bgcbypass=1000"
        being added for the slurm commands (avoid garbage collection of
        unused functions).
      * read() is sometimes generates EAGAIN error, which was not handled
        in some places.
      * vsnprintf() for string NULL is printing "" instead of "(null)"
        as produced by snprintf(). More format printing was added to log.c
        to produce more consistent log messages.
      * poll() takes a timeout of -1 for unlimited rather than any negative
        number. Modify logic that was always multiplying by 1000 to convert
        usec to msec.
      * getopt_long keyword table was not NULL terminated, resulting in
        segfault with invalid command-line argument in most commands.
      * xmalloc module assert failures were not generating a core file.
        Changed "fatal();abort();" to "error();abort();".
      * Change msg timeout from 3 sec to 5 sec. Running everything on single
        AIX node was very slow.
      55e62ab4
  9. Apr 05, 2004
  10. Mar 16, 2004
  11. Mar 04, 2004
  12. Feb 26, 2004
  13. Jan 26, 2004
  14. Jan 23, 2004
  15. Dec 31, 2003
  16. Dec 24, 2003
  17. Dec 23, 2003
    • Moe Jette's avatar
      Fix state read to deal with errors better. · 7b2428f5
      Moe Jette authored
      Fix update node RPC to handle reason field change without state change.
        State was being handled as type int instead of uint16_t so NO_VAL check
        was not working properly.
      7b2428f5
  18. Dec 22, 2003
  19. Dec 11, 2003
  20. Dec 05, 2003
    • Moe Jette's avatar
      If nodes vanish on a reconfig or slurmctld restart then: · 7ef7afe6
      Moe Jette authored
      gracefully kill all jobs allocated resources on those nodes,
      gracefully kill all pending jobs that require those nodes,
      leave pending jobs that exclude those nodes but ignore those nodes.
      Added "best_effort" argument to node_name2bitmap() function.
      Fix potential memory leak when maui scheduler interface resets
      the required nodes.
      (gnats:342)
      7ef7afe6
  21. Nov 21, 2003
  22. Nov 19, 2003
  23. Nov 18, 2003
  24. Nov 07, 2003
  25. Nov 06, 2003
  26. Oct 30, 2003
  27. Oct 24, 2003
  28. Oct 11, 2003
  29. Oct 10, 2003
  30. Oct 08, 2003
  31. Sep 29, 2003
  32. Sep 25, 2003
  33. Sep 23, 2003
  34. Sep 21, 2003
  35. Sep 19, 2003
Loading