- Sep 24, 2002
-
-
Moe Jette authored
to the job).
-
Moe Jette authored
improved failure recovery.
-
Moe Jette authored
-
Moe Jette authored
(e.g. "default"=="DEFAULT", "Yes"=="YES", etc.)
-
Moe Jette authored
-
Moe Jette authored
Nodes actually become usable when slurmd gets started or slurmctld pings the node and it responds.
-
Moe Jette authored
-
Moe Jette authored
Remove non-agent based credential revoke. Minor clean-up of code to handle non-responsive node.
-
Moe Jette authored
state change to NOT_RESPOND or DOWN.
-
- Sep 23, 2002
-
-
Moe Jette authored
slurmctld now will ping slurmd periodically and flag non-responsive nodes as such. slurmd now responds to ping RPC.
-
Moe Jette authored
slurmctld now will ping slurmd periodically and flag non-responsive nodes as such.
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
Expanded error recovery logic on slurmd registration. slurmd to register active job_id plus step_id on initial registration.
-
Moe Jette authored
Expanded error recovery logic on slurmd registration.
-
Moe Jette authored
-
- Sep 21, 2002
-
-
Mark Grondona authored
still not perfect
-
Moe Jette authored
Terminate job if slurmd restart without it.
-
Moe Jette authored
-
- Sep 20, 2002
-
-
Moe Jette authored
-
Moe Jette authored
to slurmctld on restart.
-
Mark Grondona authored
-
Moe Jette authored
Get default TmpFS from #define instead of hard-wire to "/tmp".
-
Moe Jette authored
-
Mark Grondona authored
-
Mark Grondona authored
-
Mark Grondona authored
more than 1 process per node (and generating random hw contexts)
-
Moe Jette authored
-
Moe Jette authored
-
- Sep 19, 2002
-
-
Moe Jette authored
-
Moe Jette authored
-
Mark Grondona authored
still needs work -- should we continue to use the socket or close it?
-
Moe Jette authored
enhanced debugging added for agent, now passing node name along with address.
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
-
- Sep 18, 2002
-
-
Mark Grondona authored
o close fd on POLLERR in poll() loop
-
Moe Jette authored
DejaGnu directory files).
-