- Mar 22, 2005
-
-
Moe Jette authored
external scheduler control of RoolOnly partition (Chris Holmes, HP).
-
- Mar 04, 2005
-
-
Moe Jette authored
of communication protocol over IBM Federation switch. -- Add new slurm.conf parameter ProctrackType (process tracking) and associated plugin in the slurmd module.
-
- Feb 01, 2005
-
-
Moe Jette authored
Change slurm_job_will_run() to not return a message, just an error code.
-
- Jan 27, 2005
-
-
Moe Jette authored
job steps on front-end system (e.g. Blue Gene).
-
- Nov 30, 2004
-
-
Moe Jette authored
-
- Oct 20, 2004
-
-
Moe Jette authored
All work from <takao.hatazaki@hp.com>.
-
- Oct 16, 2004
-
-
Moe Jette authored
-
- Oct 12, 2004
-
-
Moe Jette authored
tests were change appropriately. This lets one fake a Linux cluster without all of the BGL-specific logic.
-
- Sep 29, 2004
-
-
Moe Jette authored
-
- Sep 24, 2004
-
-
Moe Jette authored
Move pretty much all BGL-specific logic into that module and the associated plugin and make use of an opaque data object for maintaining the information.
-
- Sep 20, 2004
-
-
Moe Jette authored
reported as desired. Add new function to drain node (for use by select/bluegene node monitoring thread).
-
- Sep 17, 2004
-
-
Moe Jette authored
-
- Sep 15, 2004
-
-
phung4 authored
-
- Aug 25, 2004
-
-
Moe Jette authored
code for current Linux clusters and develop all new logic for Blue Gene).
-
- Aug 23, 2004
-
-
Moe Jette authored
-
- Aug 19, 2004
-
-
Moe Jette authored
overwhelm the one slurmd for the entire system).
-
- Aug 17, 2004
-
-
Moe Jette authored
Map all nodes in cluster to a single front-end node. Don't repeat ping/register/kill/etc. RPCs to all pseudo nodes, just the front-end. Treat single message for some RPCs as representing all nodes in the cluster: register, ping responce, epilog complete, etc.
-
- Aug 04, 2004
-
-
Moe Jette authored
-
- Jul 26, 2004
-
-
Moe Jette authored
-
- Jul 23, 2004
-
-
Moe Jette authored
scontrol options. For now only the NULL plugin is available, but this is required for ASC Purple.
-
- Jun 02, 2004
-
-
Moe Jette authored
option. Use SHOW_ALL as flag.
-
- Apr 30, 2004
-
-
Moe Jette authored
commands modified. Info filtering added to slurmctld.
-
- Mar 16, 2004
-
-
Moe Jette authored
flag the required state saves. Perform the saves using synchronous I/O from just one thread. Under heavy loads this results in much faster responsiveness and lowers slurmctld's memory and CPU overhead considerably.
-
- Mar 11, 2004
-
-
Moe Jette authored
slurmctld startup Create StateSaveLocation directory if changes via slurmctld reconfig
-
- Mar 05, 2004
-
-
Moe Jette authored
-
- Mar 04, 2004
-
-
Moe Jette authored
data structure. This eliminates risks associated with re-reading slurm.conf.
-
- Dec 31, 2003
-
-
Moe Jette authored
modifications were relatively minor - mostly changes in function names or arguments.
-
- Dec 23, 2003
-
-
Moe Jette authored
"Scontrol abort" works. It was leaving a hung pthread due to a recent change. Fix a couple of potential memory leaks "switch_type" has been added to config data structure, un/pack, etc, but not yet reported to the user or documented yet. The plugins now use function calls to get a their type and plugin directory from a common data structure rather than individually reading and parsing the configuration file.
-
- Nov 25, 2003
-
-
Moe Jette authored
Otherwise signal all steps associated with the job (unless individual job steps are identified).
-
- Nov 22, 2003
-
-
Moe Jette authored
Allocate calls were using job table info in place after freeing mutex. New logic copies data structures, frees mutex, sends message, and then frees memory allocated for copy.
-
- Nov 21, 2003
-
-
Moe Jette authored
-
- Nov 20, 2003
-
-
Moe Jette authored
structure.
-
- Nov 14, 2003
-
-
Moe Jette authored
-
- Nov 10, 2003
-
-
Moe Jette authored
-
- Nov 05, 2003
-
-
Moe Jette authored
-
- Oct 24, 2003
-
-
Moe Jette authored
avoid highly fragmented resource allocations. Add list of excluded nodes to job info dumpped and reported. Fix how mis-matched RPC version number are handled. Let error code get back to the API function. Dump job state information upon each job's termination via plugin. Re-issue incomplete write requests in job/partition state save. Make slurmctld continue proper operation without any default partition (gnats:317). Add command/RPC to delete a partition. Retry socket connection for slurmd/io.c as needed (gnats:253).
-
- Oct 11, 2003
-
-
Mark Grondona authored
- changed defs of HAVE_LIBELAN3 to HAVE_ELAN
-
- Oct 08, 2003
-
-
Moe Jette authored
node registration message to itself.
-
- Sep 29, 2003