From e43e81a9b81fb2d4dd62c0754a0844429df61931 Mon Sep 17 00:00:00 2001 From: Moe Jette <jette1@llnl.gov> Date: Mon, 23 May 2005 18:25:46 +0000 Subject: [PATCH] Added ibm.html and revise bluegene.html doc --- NEWS | 1 + doc/Makefile.am | 1 + doc/html/bluegene.html | 14 ++-- doc/html/documentation.html | 6 +- doc/html/ibm.html | 128 ++++++++++++++++++++++++++++++++++++ 5 files changed, 144 insertions(+), 6 deletions(-) create mode 100644 doc/html/ibm.html diff --git a/NEWS b/NEWS index c54460aa5de..53775cce2a9 100644 --- a/NEWS +++ b/NEWS @@ -5,6 +5,7 @@ documents those changes that are of interest to users and admins. ============================= -- Load bluegene.conf and federation.conf based upon SLURM_CONF env var (if set). + -- Add doc/html/ibm.html document. Update bluegene.html. * Changes in SLURM 0.5.0-pre5 ============================= diff --git a/doc/Makefile.am b/doc/Makefile.am index 1f9f6af358c..3bad8ed0268 100644 --- a/doc/Makefile.am +++ b/doc/Makefile.am @@ -14,6 +14,7 @@ html_DATA = \ html/entities.gif \ html/faq.html \ html/help.html \ + html/ibm.html \ html/jobcompplugins.html \ html/lll.gif \ html/mail.html \ diff --git a/doc/html/bluegene.html b/doc/html/bluegene.html index 1d41a85e010..d43b2348166 100644 --- a/doc/html/bluegene.html +++ b/doc/html/bluegene.html @@ -9,7 +9,7 @@ <meta http-equiv="keywords" content="Simple Linux Utility for Resource Management, SLURM, resource management, Linux clusters, high-performance computing, Livermore Computing"> <meta name="LLNLRandR" content="UCRL-WEB-209488"> -<meta name="LLNLRandRdate" content="19 May 2005"> +<meta name="LLNLRandRdate" content=" May 2005"> <meta name="distribution" content="global"> <meta name="description" content="Simple Linux Utility for Resource Management"> <meta name="copyright" @@ -176,13 +176,19 @@ In particular, the configure script searches for <i>libdb2.so</i> in the directories <i>/home/bgdb2cli/sqllib</i> and <i>/u/bgdb2cli/sqllib</i>. If your DB2 library file is in a different location, use the configure option <i>--with-db2-dir=PATH</i> to specify the parent directory. -You will need to configure and build two sets of files for installation. +If you have the samem version of the operating system on both the +Service Node (SN) and the Front End Nodes (FEN) then you can configure +and build one set of files on the SN and install them on both the SN and FEN. +If you have different versions of the operating system on the SN and FEN +(as was the case for some early system installations), then you will need +to coconfigure and build two sets of files for installation. One set will be for the Service Node (SN), which has direct access to the BGL Bridge APIs. The second set will be for the Front End Nodes (FEN), whick lack access to the Bridge APIs and interact with using Remote Proceedure Calls to the slurmctld daemon. You should see "#define HAVE_BGL 1" and "#define HAVE_FRONT_END 1" in the "config.h" file for both the SN and FEN builds. -You should also see "#define HAVE_BGL_FILES 1" in config.h on the SN before making SLURM.</p> +You should also see "#define HAVE_BGL_FILES 1" in config.h on the SN before +building SLURM. </p> <p>The slurmctld daemon should execute on the system's service node. If an optional backup daemon is used, it must be in some location where @@ -447,7 +453,7 @@ nodes, so the message is more meaningful there). </p> <td colspan="3"><hr> <p>For information about this page, contact <a href="mailto:slurm-dev@lists.llnl.gov">slurm-dev@lists.llnl.gov</a>.</p> <p><a href="http://www.llnl.gov/"><img align=middle src="lll.gif" width="32" height="32" border="0"></a></p> <p class="footer">UCRL-WEB-209488<br> -Last modified 19 May 2005</p></td> +Last modified 23 May 2005</p></td> </tr> </table> </td> diff --git a/doc/html/documentation.html b/doc/html/documentation.html index 536c5db4ac0..4ecfe59bafa 100644 --- a/doc/html/documentation.html +++ b/doc/html/documentation.html @@ -9,7 +9,7 @@ <meta http-equiv="keywords" content="Simple Linux Utility for Resource Management, SLURM, resource management, Linux clusters, high-performance computing, Livermore Computing"> <meta name="LLNLRandR" content="UCRL-WEB-209488"> -<meta name="LLNLRandRdate" content="8 October 2004"> +<meta name="LLNLRandRdate" content="23 May 2004"> <meta name="distribution" content="global"> <meta name="description" content="Simple Linux Utility for Resource Management"> <meta name="copyright" @@ -58,11 +58,13 @@ structure:Laboratories and Other Field Facilities"> <li><a href="quickstart.html">Quick Start User Guide</a></li> <li><a href="http://www.llnl.gov/LCdocs/slurm">SLURM Reference Manual</a></li> <li><a href="bluegene.html">Blue Gene User and Administrator Guide</a></li> +<li><a href="ibm.html">IBM AIX User and Administrator Guide</a></li> </ul> <h3>SLURM Administrators</h3> <ul> <li><a href="quickstart_admin.html">Quick Start Administrator Guide</a></li> <li><a href="bluegene.html">Blue Gene User and Administrator Guide</a></li> +<li><a href="ibm.html">IBM AIX User and Administrator Guide</a></li> </ul> <h3>SLURM Developers</h3> <ul> @@ -80,7 +82,7 @@ structure:Laboratories and Other Field Facilities"> <td colspan="3"><hr> <p>For information about this page, contact <a href="mailto:slurm-dev@lists.llnl.gov">slurm-dev@lists.llnl.gov</a>.</p> <p><a href="http://www.llnl.gov/"><img align=middle src="lll.gif" width="32" height="32" border="0"></a></p> <p class="footer">UCRL-WEB-209488<br> -Last modified 8 October 2004</p></td> +Last modified 23 May 2005</p></td> </tr> </table> </td> diff --git a/doc/html/ibm.html b/doc/html/ibm.html new file mode 100644 index 00000000000..222fba2ed1d --- /dev/null +++ b/doc/html/ibm.html @@ -0,0 +1,128 @@ +<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" + "http://www.w3.org/TR/REC-html40/loose.dtd"> + +<html> + +<head> +<meta http-equiv="Content-Type" content="text/html; charset=iso-8859-1"> +<meta http-equiv="Pragma" content="no-cache"> +<meta http-equiv="keywords" content="Simple Linux Utility for Resource Management, SLURM, resource management, +Linux clusters, high-performance computing, Livermore Computing"> +<meta name="LLNLRandR" content="UCRL-WEB-209488"> +<meta name="LLNLRandRdate" content=" May 2005"> +<meta name="distribution" content="global"> +<meta name="description" content="Simple Linux Utility for Resource Management"> +<meta name="copyright" +content="This document is copyrighted U.S. +Department of Energy under Contract W-7405-Eng-48"> +<meta name="Author" content="Morris Jette"> +<meta name="email" content="jette1@llnl.gov"> +<meta name="Classification" +content="DOE:DOE Web sites via organizational +structure:Laboratories and Other Field Facilities"> +<title>Simple Linux Utility for Resource Management:IBM AIX User and Administrator Guide</title> +<link href="slurmstyles.css" rel="stylesheet" type="text/css"> +</head> + +<body bgcolor="#000000" text="#000000" leftmargin="0" topmargin="0"> +<table width="770" border="0" cellspacing="0" cellpadding="0"> +<tr> +<td><img src="slurm_banner.jpg" width="770" height="145" usemap="#Map" border="0" alt="Simple Linux Utility for Resource Management"></td> +</tr> +</table> +<table width="770" border="0" cellspacing="0" cellpadding="3" bgcolor="#FFFFFF"> +<tr> +<td width="100%"> +<table width="760" border="0" cellspacing="0" cellpadding="4" align="right"> +<tr> +<td valign="top" bgcolor="#000000"><p><img src="spacer.gif" width="110" height="1" alt=""></p> +<p><a href="slurm.html" class="nav" align="center">Home</a></p> +<p><span class="whitetext">About</span><br> +<a href="overview.html" class="nav">Overview</a><br> +<a href="news.html" class="nav">What's New</a><br> +<a href="publications.html" class="nav">Publications</a><br> +<a href="team.html" class="nav">SLURM Team</a></p> +<p><span class="whitetext">Using</span><br> +<a href="documentation.html" class="nav">Documentation</a><br> +<a href="faq.html" class="nav">FAQ</a><br> +<a href="help.html" class="nav">Getting Help</a><br> +<a href="mail.html" class="nav">Mailing Lists</a></p> +<p><span class="whitetext">Installing</span><br> +<a href="platforms.html" class="nav">Platforms</a><br> +<a href="download.html" class="nav">Download</a><br> +<a href="quickstart_admin.html" class="nav">Guide</a></p></td> +<td><img src="spacer.gif" width="10" height="1" alt=""></td> +<td valign="top"><h2>IBM AIX User and Administrator Guide</h2> + +<h3>Overview</h3> + +<p>This document describes the unique features of SLURM on the +IBM AIX computers with a Federation switch. +You should be familiar with the SLURM's mode of operation on Linux clusters +before studying the relatively few differences in IBM system operation +described in this document.</p> + +<h3>User Tools</h3> + +<p>The normal set of SLURM user tools: srun, scancel, sinfo, smap, squeue and scontrol +provide all of the expected services except support for job steps. +While the srun command will launch the tasks of a job step on an IBM +AIX system, it does not support use of the Federation switch or IBM's MPI. +Job steps should be launched using IBM's poe command. +This architecture insures proper operation of all IBM tools.</p> + +<p>You will use srun to submit a batch script to SLURM. +This script should contain one or more invocations of poe to launch +the tasks. +If you want to run a job interactively, just execute poe directly. +Poe will recognize that it lacks a SLURM job allocation (the SLURM_JOBID +environment variable will be missing) and create the SLURM allocation +prior to launching tasks.</p> + +<h3>System Administration</h3> + +<p>Three unique components are required to use SLURM on an IBM system. +<ol> +<li>The Federation switch plugin is required. +This component is packaged with the SLURM distrbution.</li> +<li>There is a process tracking kernel extension required. +This is used to insure that all processes associated with a job +are tracked. +SLURM normatlly uses session ID and process group ID on Linux systems, +but these mechanisms can not prevent user processes from establishing +their own session or process group and thus "escape" from SLURM +tracking. +This kernel extension is not packaged with SLURM, but is available +upon request.</li> +<li>The final component is a library that accepts poe library calls +and performs actions in SLURM to satisfy these requests, such +as launching tasks. +This library is based upon IBM Confidential information and is +not at this time available for distribution. +Interested parties are welcome to pursue the possible distribution +of this library with IBM and SLURM developers.</li> +</ol> +Until this last issue is resolved, use of SLURM on an IBM AIX system +should not be viewed as a supported configuration (at least outside +of LLNL, which established a contract with IBM for this purpose).</p> + +<p class="footer"><a href="#top">top</a></p></td> + +</tr> +<tr> +<td colspan="3"><hr> <p>For information about this page, contact <a href="mailto:slurm-dev@lists.llnl.gov">slurm-dev@lists.llnl.gov</a>.</p> +<p><a href="http://www.llnl.gov/"><img align=middle src="lll.gif" width="32" height="32" border="0"></a></p> +<p class="footer">UCRL-WEB-209488<br> +Last modified 23 May 2005</p></td> +</tr> +</table> +</td> + </tr> +</table> +<map name="Map"> +<area shape="rect" coords="616,4,762,97" href="../"> +<area shape="rect" coords="330,1,468,11" href="http://www.llnl.gov/disclaimer.html"> +<area shape="rect" coords="11,23,213,115" href="slurm.html"> +</map> +</body> +</html> -- GitLab