- Oct 08, 2004
-
-
Moe Jette authored
and FastSchedule=0 and the cluster is heterogeneous.
-
- Sep 24, 2004
-
-
Moe Jette authored
Move pretty much all BGL-specific logic into that module and the associated plugin and make use of an opaque data object for maintaining the information.
-
- Sep 21, 2004
-
-
Moe Jette authored
rejected instead of sticking around in a pending state.
-
- Sep 16, 2004
-
-
Moe Jette authored
conn_type, and rotate). slurmctld to set min_nodes for job based upon geometry (if supplied). This avoid trying to schedule jobs when insufficient resources are available.
-
- Aug 27, 2004
-
-
Moe Jette authored
-
- Aug 25, 2004
-
-
Moe Jette authored
code for current Linux clusters and develop all new logic for Blue Gene).
-
- Aug 18, 2004
- Aug 10, 2004
-
-
Moe Jette authored
-
- Aug 06, 2004
-
-
Moe Jette authored
-
- Jul 09, 2004
-
-
Moe Jette authored
with "Shared=yes" configuration failed to function as desired. (gnats:459).
-
- Jun 05, 2004
-
-
Moe Jette authored
with FastSchedule==0 configuration.
-
- May 17, 2004
-
-
Moe Jette authored
-
- May 14, 2004
- May 11, 2004
-
-
Moe Jette authored
time or node limits.
-
- Mar 08, 2004
-
-
Moe Jette authored
(which exhaust available memory). Restructure slurmctld agent logic for higher throughput.
-
- Mar 03, 2004
-
-
Moe Jette authored
(gnats:381)
-
- Feb 20, 2004
-
-
Moe Jette authored
them when partition limits change.
-
- Jan 26, 2004
-
-
Moe Jette authored
clusters with FastSchedule configured off * Only return DOWN nodes to service if the reason for them being in that state is non-responsiveness and ReturnToService configured on * Some general code clean-up
-
- Dec 11, 2003
-
-
Moe Jette authored
sharing via node record of job count (0 | 1) and bitmap of nodes which permit sharing. Previous logic could permit a job accepting shared nodes to be scheduled on a node that already had a running job not accepting shared nodes.
-
- Dec 05, 2003
-
-
Moe Jette authored
gracefully kill all jobs allocated resources on those nodes, gracefully kill all pending jobs that require those nodes, leave pending jobs that exclude those nodes but ignore those nodes. Added "best_effort" argument to node_name2bitmap() function. Fix potential memory leak when maui scheduler interface resets the required nodes. (gnats:342)
-
- Nov 18, 2003
-
-
Moe Jette authored
attempt failure (i.e. convert to event driven instead of just polling). It does still have a 2-second timer to keep the overhead reasonable.
-
Moe Jette authored
be changed whenever a node responded to a "ping" and other insignificant events, which resulted in the backfill scheduling running more frequently than required.
-
- Nov 07, 2003
-
-
Moe Jette authored
purge the request and job if/when the node changes to state DOWN.
-
- Nov 06, 2003
-
-
Moe Jette authored
using node hostlist format.
-
- Oct 30, 2003
-
-
Moe Jette authored
-
- Oct 14, 2003
-
-
Moe Jette authored
Job's priority would be reset if it could not be immediately scheduled AND if the best-fit node selection picked a DOWN node.
-
- Oct 03, 2003
-
-
Moe Jette authored
lost of an EPILOG_COMPLETE message.
-
- Sep 30, 2003
-
-
Moe Jette authored
-
- Sep 29, 2003
-
-
Moe Jette authored
Fatal() no longer calls abort(), but terminates job using exit(1).
-
- Sep 25, 2003
- Sep 21, 2003
-
-
Moe Jette authored
transition to DRAINED state.
-
- Sep 17, 2003
- Sep 04, 2003
-
-
Moe Jette authored
resources to satisfy the request).
-
- Aug 12, 2003
-
-
Moe Jette authored
-
- Aug 04, 2003
-
-
Moe Jette authored
batch_job_launch RPC, then deallocate those resources and requeue the job. If a node registers and fails to show a batch job that should have a script running there (node zero of allocation), then consider the job complete.
-
- Jul 25, 2003
-
-
Moe Jette authored
-