Commits · 25234cd286f970146bb9bec625920243f062f05a · tud-zih-energy / Slurm

Jun 19, 2013
- Fix memory leak in step when destroying exec_wait_info · 25234cd2
  Danny Auble authored 11 years ago
  
  25234cd2
Jun 18, 2013
- PROFILING - Make sure polling threads end correctly · a8f56cb2
  Danny Auble authored 11 years ago
  
  a8f56cb2
Jun 05, 2013
- energy - only zero out the previous value if new consumption is reported. · 8d99170f
  Danny Auble authored 11 years ago
  
  8d99170f
- energy - On a single node only use the last task for gathering energy. · 09601d60
  Danny Auble authored 11 years ago
  
  Since we don't currently track energy usage per task (only per step). Otherwise we get double the energy.
  09601d60
May 24, 2013
- Infrastructure for setting up the polling thread. Not working until · 46c5a011
  Danny Auble authored 11 years ago
  
  acctg-freq is a string instead of a uint16_t
  46c5a011
May 23, 2013
- Remove infiniband polling thread. · 79add61c
  Yiannis Georgiou authored 11 years ago
  
  79add61c
May 21, 2013
- INIFINIBAND - run fini · c55b4de3
  Danny Auble authored 11 years ago
  
  c55b4de3
- INFINIBAND - change acct_gather_infiniband_g_update_node to · 19e90ed6
  Danny Auble authored 11 years ago
  
  acct_gather_infiniband_g_node_init
  19e90ed6
- INFINIBAND - Initial patch to gather infiniband stats. · 867c5615
  Yiannis Georgiou authored 11 years ago
  
  867c5615
May 15, 2013
- Add global slurmd_job_t to hdf5 and move where the profile functions · b6cfff6d
  Danny Auble authored 11 years ago
  
  are called.
  b6cfff6d
May 10, 2013
- complete rewrite of hdf5 code. There is most likely more to be done. · 692f427b
  Danny Auble authored 11 years ago
  
  692f427b
- initial check-in for hdf5 profiling · b7f75ccb
  Rod Schultz authored 11 years ago
  
  b7f75ccb
Apr 24, 2013
- changed http://www.schedmd.com/slurmdocs/ to http://slurm.schedmd.com/ · ee7bca79
  Danny Auble authored 11 years ago
  
  ee7bca79
Jan 29, 2013
- No change in logic. Chnage formatting to match linux kernel standard · c10d7f6e
  Morris Jette authored 12 years ago
  
  c10d7f6e
Nov 26, 2012

Add timeout on srun's I/O connect message to better handle some failure modes · 8405b4eb

Morris Jette authored 12 years ago

If the slurmstepd connects task I/O, but aborts after srun accepts the connect
and before slurmstepd writes data then srun could possibly hand indefinitely.
This probably does not explain failures seen at CEA, but can't hurt matters.
then the sr

8405b4eb

Nov 21, 2012

slurmstepd : correct a bug in the IO thread termination monitoring · f297242e

Matthieu Hautreux authored 12 years ago

A dedicated thread (_kill_thr) is launched by slurmstepd at the end of a
step in order to destroy the IO thread if it does not manage to correctly
terminate by itself after 300 seconds.

Two bugs are corrected in this logic by this patch.

First, the performed sleep(300) is not protected against interruptions
and this delay can be reduced to a few seconds in case of signals received
by slurmstepd, thus, reducing the delay and forcing the IO thread to
terminate before the expiration of the grace time. The logic is modified
to ensure that the delay is respected using a loop around the sleep().

Second, to terminate the IO thread, a SIGKILL is delivered to the IO thread
using pthread_kill. However, sending SIGKILL using pthread_kill is a
process-wide operation (see man pthread_kill), thus all the slurmstepd
threads are killed and slurmstepd is terminated. This logic is modified
by using pthread_cancel() instead of pthread_kill() thus letting the
pthread_join() of _wait_for_io() having a chance to act as expected.

Without this patch, when _kill_thr is interrupted, slurmstepd is
terminated, letting the step in a incomplete state, as the node may not
have been able to send the REQUEST_STEP_COMPLETE to the controler.
Thus, consecutive steps can no longer be executed and stay permanently in
the "Job step creation temporarily disabled, retrying" state.

f297242e

Nov 07, 2012

Modify default log timestamp pto conform to RFC 5424 format · 4b941731

Janne Blomqvist authored 12 years ago

the attached patch changes the default timestamp format in logfiles to conform to RFC 5424 (the current version of the syslog RFC). It is identical to the current default "ISO 8601" timestamp used by slurm, with the exception that the timezone offset is appended. This has the benefits of

1) It's unambiguous.

2) Avoids potential confusion for admins running cluster(s) in different timezones.

3) Might help debug issues related to DST transitions. (More on that later..)

(To be pedantic, a RFC 5424 timestamp is still a valid ISO 8601 timestamp, but the converse is not necessarily true. So there is RFC 3339 which is a "profile" of ISO 8601, that is a subset, recommended for internet protocols. The RFC 5424 timestamp, in turn, is a subset of the RFC 3339 timestamps.)

The previous behavior of can be used by running configure with the

--disable-rfc5424time

flag.

4b941731

Oct 22, 2012

Add task_pre_launch_priv call in task plugin API · 66e80a49

Matthieu Hautreux authored 12 years ago

This privileged call is executed just after the fork of each
forked task by user root before becoming the user. It enables
the task plugin to perform actions as a privileged user in
every task context.

66e80a49

Oct 16, 2012
- For accounting, correct data types in read of /proc · 24f0cd53
  Morris Jette authored 12 years ago
  
  24f0cd53
Oct 15, 2012

Purely cosmetic modifications for new energy consumption code · f9fdd65b
Morris Jette authored 12 years ago

f9fdd65b

Energy accounting patch enhancements · 831c5ac4

yiannis georgiou authored 12 years ago

provides the following improvements on the energy accounting framework :
1)the per step average frequency is now calculated correctly and may be reported by sstat and sacct
2)correction on the logic of per step energy consumption calculation through rapl plugin. This value may be also reported through sstat and sacct
3)node power and energy monitoring now working correctly through rapl plugin.

831c5ac4

Jul 16, 2012
- Add support for user setting of cpu frequency for a job step · ac3fb2bf
  Don Albert authored 12 years ago
  
  ac3fb2bf
Jul 13, 2012

slurmstepd: don't call exec if task fails to get notification from parent · 9006dda4

Mark A. Grondona authored 12 years ago

If exec_wait_child_wait_for_parent() fails for any reason, it is safer
to abort immediately rather than proceed to execute the user's job.

9006dda4

slurmstepd: Kill remaining children if fork fails · 5b8dba9e

Mark A. Grondona authored 12 years ago

On a failure of fork(2), slurmstepd would print an error and exit,
possibly leaving previously forked children waiting.

Ensure a better cleanup by killing all active children on fork failure
before exiting slurmstepd.

5b8dba9e

slurmstepd: Close childfd of exec_wait_info in parent · eca089e3

Mark A. Grondona authored 12 years ago

Close the read end of the pipe slurmstepd uses to notify children
it is time to call exec(2) in order to save one file descriptor per
task. (Previously, the read side of the pipe wasn't closed until
exec_wait_info was destroyed)

eca089e3

May 29, 2012
- Changed jobacct_gather plugin infrastructure to be cleaner and easier to · 6d015792
  Danny Auble authored 12 years ago
  
  maintain.
  6d015792
May 11, 2012
- Patch to add experimental jobacct_gather/cgroup plugin. · 468326c4
  Martin Perry authored 12 years ago
  
  Original patch from Martin Perry (Bull)
  468326c4
May 07, 2012
- Add user_id to batch complete RPC · 35be1b13
  Morris Jette authored 12 years ago
  
  35be1b13
May 05, 2012
- Route REQUEST_COMPLETE_BATCH_SCRIPT RPC from slurmstepd through slurmd · fa7c0779
  Morris Jette authored 12 years ago
  
  This will eventually permit the slurmctld to respond to the slurmd with a new task launch RPC
  fa7c0779
Apr 27, 2012

Cray - Add support for batch job with zero compute nodes · cd6fb7e5

Morris Jette authored 12 years ago

Cray - Add support for zero compute note resource allocation to run batch
script on front-end node with no ALPS reservation. Useful for pre- or post-
processing. NOTE: The partition must be configured with MinNodes=0.

cd6fb7e5

Mar 26, 2012
- More work on NRT load call · d6755101
  Morris Jette authored 13 years ago
  
  Add job name to call. Add logging for call.
  d6755101
Mar 22, 2012

Cosmetic mods for better clarity · a3fe59db
Morris Jette authored 13 years ago

a3fe59db

Rearrange job manager's content to better handle secured FS · 7b55fb81

Matthieu Hautreux authored 13 years ago

Access to secured FS often requires to have a valid token in the
user context. With SLURM, this token can be obtained using one of
the possible pluggable architecture, SPANK or PAM.

IO setup of SLURM can require to access secured FS (stdout/stderr
files). This patch ensures that pluggable frameworks are activated
and called prior to IO setup and that IO are terminated before
calling pluggable framework exit calls.

7b55fb81

slurmstepd/job_manager(): move PR_SET_DUMPABLE set to the begining · b2917cf5

Matthieu Hautreux authored 13 years ago

set PR_DUMPABLE as soon as possible, especially before any plugins
are loaded. This will allow someone debugging to get a coredump.

b2917cf5

slurmstepd/_fork_all_tasks: error handling cleanup · d589ea7d

Matthieu Hautreux authored 13 years ago

To prepare io_setup integration in _fork_all_tasks, error handling
must be transformed to not always return SLURM_ERROR but be prepared
to return SLURM_SUCCESS in case of an io_setup error.

d589ea7d

Feb 04, 2012

Add call to MPI plugin · 231c927c

Morris Jette authored 13 years ago

Add call to mpi_hook_slurmstepd_prefork() from slurmstep
immediately prior to fork/exec of user tasks.
Patch from Hongjia Cao, NUDT.

231c927c

Jan 19, 2012
- Minor improvement in slurmstepd signal handling · 08fce8ee
  Morris Jette authored 13 years ago
  
  08fce8ee
Jan 18, 2012
- Correction to signal handling logic in slurmstepd · bd72f4ee
  Morris Jette authored 13 years ago
  
  bd72f4ee
Jan 17, 2012
- Minor mods for signal unblocking mods · 7b3f0eda
  Morris Jette authored 13 years ago
  
  7b3f0eda
Jan 13, 2012

slurmstepd: unblock all signals before invoking user job · 06047590

Mark A. Grondona authored 13 years ago

It was found that slurmstepd was intermittently leaving SIGPIPE
blocked when launching user tasks. This may have something to do
with the fact that the xsignal_unblock() call in _fork_all_tasks()
is referencing an extern array (nominally this should have unblocked
SIGPIPE), but I didn't spend the time to fully track this issue
down. Instead, I figured there is probably no reason we would _not_
want to unblock *all* signals, so this patch does that.

Before this change, the following program fails every once in awhile:

 #include <stdio.h>
 #include <signal.h>

 int main (int ac, char **av)
 {
	int i, rc = 0;
	struct sigaction act;
	for (i = 1; i < SIGRTMAX; i++) {
		sigaction (i, NULL, &act);
		if (act.sa_handler == SIG_DFL)
			continue;
		fprintf (stderr, "Signal %d appears to be ignored!\n", i);
		rc = 1;
	}
	return (rc);
 }

with:

 srun -N1 -n1 ./test
 Signal 13 appears to be ignored!

after the change, the program succeeds.

06047590