- Sep 27, 2011
-
-
Morris Jette authored
Note that a job's size factor in setting priority is based upon node or CPU count.
-
Morris Jette authored
Add the ability to read the slurm.conf file across a network using the accounting_storage plugin. The accounting_storage/slurmdbd plugin does require modification for this functionality to fully operate. Patch from Andriy Grytsenko (Massive Solutions Limited).
-
Morris Jette authored
-
Mark A. Grondona authored
The slurmctld code that processes job notify messages unecessarily restricts these messages to be from the slurm user or root. This patch allows users to send notifications to their own jobs.
-
Mark A. Grondona authored
The slurmctld code that processes job notify messages unecessarily restricts these messages to be from the slurm user or root. This patch allows users to send notifications to their own jobs.
-
Morris Jette authored
Interpret a reservation with Nodes=ALL and a Partition specification as reserving all nodes within the specified partition rather than all nodes on the system. Based upon patch by Phil Eckert (LLNL).
-
Morris Jette authored
Change the event trigger data structure for slightly improved performance.
-
Morris Jette authored
An a flag record to event triggers and add support for a flag value of "PERM" for permanent triggers, triggers which are only removed when the slurmctld daemon is cold-started or the trigger is explicitly removed.
-
- Sep 26, 2011
-
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Morris Jette authored
-
Morris Jette authored
Many cosmetic modifications to eliminate warning message from GCC version 4.6 compiler, mostly due to unused variables.
-
- Sep 21, 2011
-
-
Danny Auble authored
without putting a block in error state.
-
- Sep 20, 2011
-
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Permit administrator to change a job's QOS to any value without validating the job's owner has permission to use that QOS. Based upon patch by Phil Eckert (LLNL).
-
Morris Jette authored
-
Morris Jette authored
Update tests as appropriate for change in output of many commands. BP_LIST or BP_List change to MIDPLANELIST or MidplaneList.
-
Morris Jette authored
Modify advance reservation to accept multiple specific block sizes rather than a single node count. This is very important for BlueGene systems.
-
Morris Jette authored
Modify srun's SIGINT handling logic timer (two SIGINTs within one second) to be based microsecond rather than second timer.
-
- Sep 19, 2011
-
-
Danny Auble authored
midplane that has that cable as it's 'out' cable. Since the node isn't really down there isn't really a better place to mark a note about it.
-
Danny Auble authored
structure.
-
Danny Auble authored
-
- Sep 17, 2011
-
-
Danny Auble authored
-
Danny Auble authored
jobs happen to be running on blocks not in the new config.
-
- Sep 16, 2011
-
-
Morris Jette authored
salloc/mpirun does not play well together with task affinity socket binding. The following example illustrates the problem. [sulu] (slurm) mnp> salloc -p bones-only -N1-1 -n3 --cpu_bind=socket mpirun cat /proc/self/status | grep Cpus_allowed_list salloc: Granted job allocation 387 -------------------------------------------------------------------------- An invalid physical processor id was returned ... The problem is that with mpirun jobs Slurm launches only a single task, regardless of the value of -n. This confuses the socket binding logic in task affinity. The result is that task affinity binds the task to only a single cpu, instead of all the allocated cpus on the socket. When mpi attempts to bind to any of the other allocated cpus on the socket, it gets the "invalid physical processor id" error. Note that the problem may occur even if socket binding is not explicitly requested by the user. If task/affinity is configured and the allocated CPUs are a whole number of sockets, Slurm will use "implicit auto binding" to sockets, triggering the problem. Patch from Martin Perry (Bull).
-
Danny Auble authored
Conflicts: NEWS
-
Danny Auble authored
-
Danny Auble authored
block from the database.
-
Morris Jette authored
Update reservation web page to describe mechanism to reserve CPUs rather than whole nodes and provide an example.
-
Danny Auble authored
before we do.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
-
- Sep 15, 2011
-
-
Morris Jette authored
Avoid clearing a job's reason from JobHeldAdmin or JobHeldUser when it is otherwise updated using scontrol or sview commands. Patch based upon work by Phil Eckert (LLNL).
-
Morris Jette authored
Do not remove the backup slurmctld's pid file when it assumes control, only when it actually shuts down. Patch from Andriy Grytsenko (Massive Solutions Limited).
-