- Mar 16, 2012
-
-
Danny Auble authored
-
- Mar 14, 2012
-
-
Morris Jette authored
Cray - For srun wrapper when creating a job allocation, set the default job name to the executable file's name. Ignore leading directory names in the path.
-
- Mar 13, 2012
-
-
Morris Jette authored
permit the srun and salloc commands to be executed in the background on Cray systems
-
Morris Jette authored
Add new job state reason of "FrontEndDown" which applies only to Cray and IBM BlueGene systems.
-
Danny Auble authored
-
- Mar 12, 2012
-
-
Danny Auble authored
the queue when trying to place a larger than midplane job.
-
- Mar 02, 2012
-
-
Morris Jette authored
In cray/srun wrapper, only include aprun "-q" option when srun "--quiet" option is used.
-
- Feb 29, 2012
-
-
Morris Jette authored
-
- Feb 28, 2012
-
-
Morris Jette authored
-
- Feb 24, 2012
-
-
Morris Jette authored
-
Morris Jette authored
-
Danny Auble authored
-
Morris Jette authored
-
- Feb 23, 2012
-
-
Danny Auble authored
-
- Feb 20, 2012
-
-
jette authored
Patch from Aleksej Saushev.
-
- Feb 06, 2012
-
-
Danny Auble authored
is a convenience function in BSD and glibc that internally calls the equivalent of int masterfd = open("/dev/ptmx", flags); grantpt (masterfd); unlockpt (masterfd); int slavefd = open (slave, O_RDRW|O_NOCTTY); (in psuedocode) On Linux, with some combinations of glibc/kernel (in this case glibc-2.14/Linux-3.1), the equivalent of grantpt(3) was failing in slurmstepd with EPERM, because the allocated pty was getting root ownership instead of the user running the slurm job. From the POSIX description of grantpt: "The grantpt() function shall change the mode and ownership of the slave pseudo-terminal device... The user ID of the slave shall be set to the real UID of the calling process..." http://pubs.opengroup.org/onlinepubs/007904875/functions/grantpt.html This means that for POSIX-compliance, the real user id of slurmstepd must be the user executing the SLURM job at the time openpty(3) is called. Unfortunately, the real user id of slurmstepd at this point is still root, and only the effective uid is set to the user. This patch is a work-around that uses the (non-portable) setresuid(2) system call to reset the real and effective uids of the slurmstepd process to the job user, but keep the saved uid of root. Then after the openpty(3) call, the previous credentials are reestablished using the same call.
-
- Feb 03, 2012
-
-
Morris Jette authored
Fix for srun allocating running within existing allocation with --exclude option and --nnodes count small enough to remove more nodes. > salloc -N 8 salloc: Granted job allocation 1000008 > srun -N 2 -n 2 --exclude=tux3 hostname srun: error: Unable to create job step: Requested node configuration is not available Patch from Phil Eckert, LLNL.
-
- Feb 02, 2012
-
-
Morris Jette authored
Fix bug in step task distribution when nodes are not configured in numeric order. Patch from Hongjia Cao, NUDT.
-
- Feb 01, 2012
-
-
Morris Jette authored
Fix bug when requeued batch job is scheduled to run on a different node zero, but attemts job launch on old node zero causing fatal error "Invalid host_index -1 for job #"
-
Morris Jette authored
Avoid slurmctld abort due to bad pointer when setting an advanced reservation MAINT flag if it contains no nodes (only licenses).
-
- Jan 31, 2012
-
-
Danny Auble authored
blocks are in an error state.
-
Morris Jette authored
-
Danny Auble authored
to give a correct priority on the first decay cycle after a restart of the slurmctld. Patch from Martin Perry, Bull.
-
- Jan 27, 2012
-
-
Danny Auble authored
Lucero Palau.
-
Morris Jette authored
This patch was previously applied to SLURM v2.4 and is being back-ported due to problems being reported in SLURM v2.3. Original commit is here https://github.com/SchedMD/slurm/commit/4c0eea7b8c20ccb1cacad51838a1ea8257cc637d
-
- Jan 25, 2012
-
-
Morris Jette authored
Set DEFAULT flag in partition structure when slurmctld reads the configuration file. Patch from Rémi Palancher. Note the flag is set when the information is sent via RPC for sinfo.
-
- Jan 24, 2012
-
-
Morris Jette authored
-
- Jan 20, 2012
-
-
Morris Jette authored
Fix for possible invalid memory reference in slurmctld in job dependency logic. Patch from Carles Fenoy (Barcelona Supercomputer Center).
-
- Jan 19, 2012
-
-
Danny Auble authored
all jobs would be returned even if the flag was set. Patch from Bill Brophy, Bull.
-
- Jan 18, 2012
-
-
Morris Jette authored
Fix bug in --switch option with topology resulting in bad switch count use. Patch from Alejandro Lucero Palau (Barcelona Supercomputer Center).
-
- Jan 13, 2012
-
-
Danny Auble authored
number.
-
Morris Jette authored
-
Morris Jette authored
Let operators see reservation data even if "PrivateData=reservations" flag is set in slurm.conf. Patch from Don Albert, Bull.
-
- Jan 09, 2012
-
-
Morris Jette authored
Fix bug in srun --multi-prog configuration file to avoid printing duplicate record error when "*" is used at the end of the file for the task ID. It means all task IDs not otherwise identified.
-
Morris Jette authored
Fix race condition where sbcast command can result in deadlock of slurmd daemon. Patch by Don Albert, Bull.
-
- Dec 28, 2011
-
-
Morris Jette authored
-
- Dec 21, 2011
-
-
Morris Jette authored
-
- Dec 19, 2011
-
-
Morris Jette authored
-
- Dec 17, 2011
-
-
Morris Jette authored
-
- Dec 15, 2011
-
-
Morris Jette authored
Prevent resetting a held job's priority when updating other job parameters. Patch from Alejandro Lucero Palau, BSC.
-