- Dec 10, 2013
-
-
Jason Sollom authored
-
Danny Auble authored
-
Jason Sollom authored
-
Danny Auble authored
-
David Bigagli authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
-
Morris Jette authored
Conflicts: src/squeue/print.c src/sview/job_info.c
-
- Dec 09, 2013
-
-
Morris Jette authored
This is needed for job arrays with discontiguous task ID values (e.g. "123_[1,3,5,...99999]")
-
Morris Jette authored
Previously job arrays were only listed with their native job ID (e.g. 123_0 listed as 123, 123_1 as 124, etc). Now lists the job ID using both format (e.g. "123_1 (124)"). The same format is used for job step IDs (e.g. "123_1.2 (124.2)").
-
- Dec 08, 2013
-
-
jette authored
-
jette authored
If the GRES is associated with specific files AND the GRES count is reset using scontrol AND the slurmd is restarted either without a gres.conf file or with a count and no specific files AND the GRES count is then increased using scontrol the GRES bitmap will not match its count This fixes the root cause of the mismatch between bitmap size and GRES count and should render the rebuilding of the bitmap unnecessary. The rebuilding was handled in the following commits commit ec4df3bf commit 1712d619
-
- Dec 07, 2013
-
-
David Bigagli authored
-
Morris Jette authored
Correction to commit 5a4b9e0c
-
David Bigagli authored
conflict resolution.
-
Danny Auble authored
-
Danny Auble authored
-
Philip D. Eckert authored
-
David Bigagli authored
This reverts commit 58c12f7e.
-
David Bigagli authored
the slurmctld throws a fatal error.
-
- Dec 06, 2013
-
-
Jason Bacon authored
Using CPU: Intel(R) Pentium(R) 4 CPU 2.40GHz (2392.04-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0xf27 Family = f Model = 2 Stepping = 7 Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE> It's also using an older version of hwloc (1.3.1) and I have not yet tested it with a newer one, but since 0 and -1 are legitimate returns values for hwloc_get_nbobjs_by_type(), I think they should be handled in any case. From the hwloc_get_nbobjs_by_type() man page: static inline int hwloc_get_nbobjs_by_type (hwloc_topology_ttopology, hwloc_obj_type_ttype) [static] Returns the width of level type type. If no object for that type exists, 0 is returned. If there are several levels with objects of that type, -1 is returned. I'm attaching a smarter patch that handles both 0 and -1 return values for both CORE and SOCKET. It logs a warning if it has to fudge a 0 return code and bails out with a helpful error message for -1, which I have no idea how to handle. At least people won't have to waste time tracking down the problem this way. Happy Friday, Jason
-
Trofinoff Stephen authored
This adds a mechanism to kill a hung apbasil command
-
Morris Jette authored
-
Morris Jette authored
error introduced in commit ec4df3bf
-
Morris Jette authored
-
Jason Bacon authored
-
Danny Auble authored
Fix for python 3 encoding
-
Morris Jette authored
A abort has been reported if the node's gres count differs from it's bitmap. This has been induced by changing the count manually (e.g. scontrol update nodename=tux123 gres=gpu:4"). I have not been able to reproduce this problem, but this will resize the bitmap in order to avoid the assert failure.
-
Danny Auble authored
-
- Dec 05, 2013
-
-
Danny Auble authored
-
Danny Auble authored
news.html.
-
Teun Docter authored
-
Danny Auble authored
-
Taras Shapovalov authored
instead of when running on the node for the first time.
-
Danny Auble authored
-
Danny Auble authored
-
Danny Auble authored
not global macros.
-
Danny Auble authored
-