- Dec 20, 2013
-
-
Danny Auble authored
-
Danny Auble authored
runjob_mux.
-
Danny Auble authored
0c00d359
-
Danny Auble authored
for better debug
-
Danny Auble authored
midplane block that starts on a higher coordinate than it ends (i.e if a block has midplanes [0010,0013] 0013 is the start even though it is listed second in the hostlist).
-
- Dec 19, 2013
-
-
Morris Jette authored
It has been changed to improve the calculated value for pending jobs and use the actual node count value for jobs that have been started (including suspended, completed, etc.) bug 549
-
- Dec 18, 2013
-
-
Danny Auble authored
that spans multiple midplanes the cnodes are correctly accounted for that are in error.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
being in error.
-
Danny Auble authored
-
Morris Jette authored
Note that each job's node allocation is counted separately. bug 548
-
- Dec 17, 2013
-
-
Danny Auble authored
-
Danny Auble authored
will return ENOTCONN and not initialize the addr_str causing valgrind errors.
-
- Dec 16, 2013
-
-
Morris Jette authored
-
Morris Jette authored
-
Hughes, Doug authored
This allows multiple job ids to hold, uhold, resume, suspend, release, etc.
-
Morris Jette authored
-
- Dec 14, 2013
-
-
Danny Auble authored
226b49a3
-
Morris Jette authored
Test would periodically fail due to expect timing. This seems to fix the problem
-
Danny Auble authored
-
- Dec 13, 2013
-
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
Fix slurmstepd race condition when separate threads are reading and modifying the job's environment, which can result in the slurmstepd failing with an invalid memory reference. Observed at shutdown when trying to run the task epilog and trying to read the env var: SLURM_STEP_KILLED_MSG_NODE_ID
-
Morris Jette authored
We do not want to look at the core file, so avoid generating it and then having to manually clear it later.
-
- Dec 12, 2013
-
-
Morris Jette authored
Without this change, sstat would try to unpack accounting data that was never packed, resulting in message unpack errors.
-
Morris Jette authored
There were some parsing issues and the test was not as general as it should have been
-
Danny Auble authored
-
Danny Auble authored
throw away initialized variable.
-
Morris Jette authored
Without this patch, free() is called on a random memory location (i.e. whatever is on the stack), which can result in slurmstepd dying and a completed job not being purged in a timely fashion.
-
- Dec 11, 2013
-
-
Morris Jette authored
-
Danny Auble authored
-
Morris Jette authored
Fix race condition in authentication credential creation that could corrupt memory. (NOTE: This race condition has existed since 2003 and would be exceedingly rare.)
-
- Dec 10, 2013
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
- Dec 09, 2013
-
-
Morris Jette authored
This is needed for job arrays with discontiguous task ID values (e.g. "123_[1,3,5,...99999]")
-
Morris Jette authored
Previously job arrays were only listed with their native job ID (e.g. 123_0 listed as 123, 123_1 as 124, etc). Now lists the job ID using both format (e.g. "123_1 (124)"). The same format is used for job step IDs (e.g. "123_1.2 (124.2)").
-
- Dec 08, 2013
-
-
jette authored
-