- Dec 06, 2005
-
-
Moe Jette authored
-
- Nov 11, 2005
-
-
Moe Jette authored
(still needs some work for scalability).
-
- Aug 12, 2005
-
-
Christopher J. Morrone authored
-
- Apr 23, 2004
-
-
Moe Jette authored
* Memory leak in slurm_cred.c, added EVP_MD_CTX_cleanup(). * Pthread stack size too small on AIX. Resulting in stack corruption and ugly failure modes. Added slurm_attr_init to macros.h to explicitly set the stack size for all pthreads. * /dev/urandom not present on AIX, use rand() as needed instead in constructing a credential. Used in "srun --join". * getsockopt(Socket, Level, SO_ERROR, &err, OptionLenght) sometime returns an error code of -1. This causes an assert failure in slurmd/io.c:_update_error_state(). * Function aliasing is not working on AIX. It is being turned off via a variable in config.h and "#if" logic in macros.h and slurm_xlator.h. * dlopen failing if plugins reference any functions not present in caller. This may be fixed with the LDFLAG "-Wl,-bgcbypass=1000" being added for the slurm commands (avoid garbage collection of unused functions). * read() is sometimes generates EAGAIN error, which was not handled in some places. * vsnprintf() for string NULL is printing "" instead of "(null)" as produced by snprintf(). More format printing was added to log.c to produce more consistent log messages. * poll() takes a timeout of -1 for unlimited rather than any negative number. Modify logic that was always multiplying by 1000 to convert usec to msec. * getopt_long keyword table was not NULL terminated, resulting in segfault with invalid command-line argument in most commands. * xmalloc module assert failures were not generating a core file. Changed "fatal();abort();" to "error();abort();". * Change msg timeout from 3 sec to 5 sec. Running everything on single AIX node was very slow.
-
- Jan 17, 2004
-
-
Moe Jette authored
alloca() has been replace by malloc() and free(), Use of __progname eliminated, GNU compliant malloc() wrapper added, RLIMIT_* definition use controlled by #ifdef's, Changed some data types, and conditionally added some #includes.
-
- Oct 23, 2003
-
-
Moe Jette authored
from slurm_open_stream (which is only called by slurmd/io.c to connect stdin/out/err). (gnats:253)
-
- Oct 22, 2003
-
-
Moe Jette authored
is zero.
-
Mark Grondona authored
-
- Oct 13, 2003
- Oct 10, 2003
-
-
Moe Jette authored
-
- Sep 19, 2003
- Sep 11, 2003
-
-
Moe Jette authored
-
- Sep 05, 2003
-
-
Moe Jette authored
of socket communications. Previously was sometimes overwriting legitimate SLURM error code with fcntl error code of EINTR.
-
Moe Jette authored
-
Moe Jette authored
occur naturally if a srun, scontrol, scancel, sinfo, or squeue command is killed by the user with a communication to slurmctld in progress. This seems to occur fairly regularly as part of batch job termination.
-
Moe Jette authored
-
Moe Jette authored
send/receive, function (poll, timeout, send, recv, etc), and the error message are all reported.
-
- Jul 09, 2003
-
-
Moe Jette authored
specifically to slurm controller.
-
- May 07, 2003
-
-
Mark Grondona authored
o Fix -Wall warnings
-
Mark Grondona authored
slurm_protocol_socket_implementation.c o Added new functions slurm_send_recv*rc_msg() for handing common mode of sending a message and receiving only a "return code" message. o Overhauled existing code and performed general cleanup.
-
- Apr 04, 2003
-
-
Moe Jette authored
-
- Feb 14, 2003
-
-
Mark Grondona authored
o srun/ : place temporary fix for stdout nonblocking I/O problem
-
- Dec 14, 2002
-
-
Moe Jette authored
interrupts were resetting timeout in I/O functions with timeout.
-
- Dec 05, 2002
-
-
Mark Grondona authored
still remain. o cbuf.[ch], list.[ch] : Checked in new revisions from lsd-tools o srun/io.c,msg.c : added io_thr_create(), msg_thr_create() to simplify code in main(). o Other minor fixes/changes
-
- Nov 21, 2002
-
-
Moe Jette authored
-
- Nov 20, 2002
-
-
Mark Grondona authored
o fix _STMT_START in macros.h o remove annoying "Interrupted system call" debug error for accept() with slurm_protocol_socket_implementation.c o fix some assert macros in xassert and xmalloc
-
- Nov 15, 2002
-
-
Moe Jette authored
from conman.
-
- Nov 07, 2002
-
-
Moe Jette authored
than 1024 file descriptors the select function call silently fails and corrupts memory.
-
- Nov 01, 2002
-
-
Moe Jette authored
bad message. Functions now return SLURM_ERROR on bad unpack instead of assert failure.
-
- Oct 31, 2002
-
-
Mark Grondona authored
blown away in v1.57. o fix bug in _slurm_set_addr_char() which caused port and family to be updated even though gethostbyname() failed.
-
Moe Jette authored
(so shutdown command can shutdown both servers).
-
- Oct 29, 2002
-
-
Mark Grondona authored
o rewrite of slurmd.c -- extensive changes therein o launch msg contains port numbers instead of slurm_addrs
-
- Oct 28, 2002
-
-
Moe Jette authored
a slurm_addr.
-
- Oct 24, 2002
-
-
Moe Jette authored
-
- Oct 18, 2002
-
-
Moe Jette authored
-
- Oct 15, 2002
-
-
Moe Jette authored
-
- Oct 07, 2002
-
-
Moe Jette authored
that includes a buffer pointer, size, and offset. The buffer is automatically expanded as space is needed.
-
- Sep 26, 2002
-
-
Moe Jette authored
_slurm_msg_sendto was not advancing the buffer pointer or decreasing the xmit size if the message could not be transmitted in a single send request. _slurm_msg_recvfrom_timeout was leaving data on the socket and not reporting any errors if the buffer was too small for the data presented.
-