- Oct 11, 2003
-
-
Mark Grondona authored
- changed defs of HAVE_LIBELAN3 to HAVE_ELAN
-
Moe Jette authored
Send multiple SIGALRMs if needed and deal with possible abort of a thread.
-
- Oct 10, 2003
- Oct 08, 2003
- Oct 03, 2003
- Oct 01, 2003
-
-
Moe Jette authored
-
- Sep 30, 2003
- Sep 29, 2003
-
-
Moe Jette authored
let that job be scheduled even if there are jobs in COMPLETING state.
-
Moe Jette authored
Fatal() no longer calls abort(), but terminates job using exit(1).
-
Moe Jette authored
(state is completing and end time is within KillWait (+2 seconds).
-
Moe Jette authored
save on job_step completion. This should reduce the likelyhood of information being lost on a slurmctld restart.
-
Moe Jette authored
-
Moe Jette authored
-
- Sep 25, 2003
-
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
ranged string rather than one message per node (too verbose).
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
node begins to respond again.
-
Moe Jette authored
lowest active job priority (for working down as new jobs are initiated). This bug only applies when slurmctld restarts and there are jobs in system hold state (priority 1).
-
Moe Jette authored
-
Moe Jette authored
This is a problem on PVC, where nodes are regularly DRAINED.
-
- Sep 23, 2003
-
-
Moe Jette authored
-
Moe Jette authored
-
Moe Jette authored
scalability. An arbitrary number of requests may be queued and they are processed one per second until the queue is empty or pending requests were last attempted recently (configuration parameters set to 60 seconds as a minimum retry interval).
-
Moe Jette authored
These jobs are reported by slurmd on node registration. They are logged but otherwise ignored by slurmctld. Several changes to slurmd logging messaged to report job id and step id using %u format rather than %d format (which shows no-allocate job id values as negative numbers).
-
- Sep 21, 2003
-
-
Moe Jette authored
that reach the slurm inactivity time limit.
-
Moe Jette authored
control (it needs to complete all pending RPCs and save state before the primary reads state and takes over).
-
Moe Jette authored
responsibilities (backup was routinely reading at startup).
-
Moe Jette authored
server to shutdown request.
-
Moe Jette authored
SIGPWR, and SIGLOST.
-
Moe Jette authored
and when returned to service went improperly back into state DRAINING (job counter was inconsistent).
-
Moe Jette authored
transition to DRAINED state.
-
- Sep 20, 2003