Skip to content
Snippets Groups Projects
  1. Dec 06, 2013
    • Jason Bacon's avatar
      Improve hwloc support for various processors · ac5d734b
      Jason Bacon authored
      Using CPU: Intel(R) Pentium(R) 4 CPU 2.40GHz (2392.04-MHz 686-class CPU)
        Origin = "GenuineIntel"  Id = 0xf27  Family = f  Model = 2  Stepping = 7
      Features=0xbfebfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE>
      
      It's also using an older version of hwloc (1.3.1) and I have not yet tested it with a newer one, but since 0 and -1 are legitimate returns values for hwloc_get_nbobjs_by_type(), I think they should be handled in any case.
      
      From the hwloc_get_nbobjs_by_type() man page:
      
      static inline int  hwloc_get_nbobjs_by_type (hwloc_topology_ttopology,
             hwloc_obj_type_ttype) [static]
             Returns the width of level type type. If no object for that type
             exists, 0 is returned. If there are several levels with objects of that
             type, -1 is returned.
      
      I'm attaching a smarter patch that handles both 0 and -1 return values for both CORE and SOCKET.  It logs a warning if it has to fudge a 0 return code and bails out with a helpful error message for -1, which I have no idea how to handle.  At least people won't have to waste time tracking down the problem this way.
      
      Happy Friday,
      
          Jason
      ac5d734b
    • Trofinoff  Stephen's avatar
      Added ApbasilTimeout parameter to the cray.conf · 270f696e
      Trofinoff Stephen authored
      This adds a mechanism to kill a hung apbasil command
      270f696e
    • Morris Jette's avatar
      Fix bad print · 1712d619
      Morris Jette authored
      error introduced in commit ec4df3bf
      1712d619
    • Jason Bacon's avatar
      Fix for hwloc returning zero core count · ec4df3bf
      Jason Bacon authored
      ec4df3bf
    • Morris Jette's avatar
      Fix for gres count change · 4e56260f
      Morris Jette authored
      A abort has been reported if the node's gres count differs from
      it's bitmap. This has been induced by changing the count manually
      (e.g. scontrol update nodename=tux123 gres=gpu:4"). I have not
      been able to reproduce this problem, but this will resize the
      bitmap in order to avoid the assert failure.
      4e56260f
  2. Dec 05, 2013
  3. Dec 04, 2013
  4. Dec 03, 2013
  5. Dec 02, 2013
  6. Nov 29, 2013
  7. Nov 27, 2013
  8. Nov 26, 2013
  9. Nov 25, 2013
  10. Nov 24, 2013
Loading