diff --git a/doc/man/man5/slurm.conf.5 b/doc/man/man5/slurm.conf.5 index 1903db160877322fb5af81eacf47959580bf9d1e..76e612caf34c1b7edea9aee76e6335432cac0324 100644 --- a/doc/man/man5/slurm.conf.5 +++ b/doc/man/man5/slurm.conf.5 @@ -203,12 +203,13 @@ execution of the command "sudo modprobe msr". .TP \fBAcctGatherInfinibandType\fR Identifies the plugin to be used for infiniband network traffic accounting. -The plugin is activated only when profiling on hdf5 files is activated and -the user asks for network data collection for jobs through \-\-profile=Network -(or =All). The collection of network traffic data takes place on node level, +The jobacct_gather plugin and slurmd daemon call this plugin to collect +network traffic data for jobs and nodes. +The collection of network traffic data takes place on the node level, hence only in case of exclusive job allocation the collected values will -reflect the jobs real traffic. All network traffic data are logged on hdf5 files -per job on each node. No storage on the Slurm database takes place. +reflect the job's real traffic. In case of node sharing between jobs the reported +network traffic per job (through sstat or sacct) will not reflect the real +network traffic by the jobs. Configurable values at present are: .RS @@ -219,17 +220,21 @@ No infiniband network data are collected. \fBacct_gather_infiniband/ofed\fR Infiniband network traffic data are collected from the hardware monitoring counters of Infiniband devices through the OFED library. +In order to account for per job network traffic, add the "ic/ofed" TRES to +\fIAccountingStorageTRES\fR. .RE .TP \fBAcctGatherFilesystemType\fR Identifies the plugin to be used for filesystem traffic accounting. -The plugin is activated only when profiling on hdf5 files is activated and -the user asks for filesystem data collection for jobs through \-\-profile=Lustre -(or =All). The collection of filesystem traffic data takes place on node level, +The jobacct_gather plugin and slurmd daemon call this plugin to collect +filesystem traffic data for jobs and nodes. +The collection of filesystem traffic data takes place on the node level, hence only in case of exclusive job allocation the collected values will -reflect the jobs real traffic. All filesystem traffic data are logged on hdf5 files -per job on each node. No storage on the Slurm database takes place. +reflect the job's real traffic. In case of node sharing between jobs the reported +filesystem traffic per job (through sstat or sacct) will not reflect the real +filesystem traffic by the jobs. + Configurable values at present are: .RS @@ -240,6 +245,8 @@ No filesystem data are collected. \fBacct_gather_filesystem/lustre\fR Lustre filesystem traffic data are collected from the counters found in /proc/fs/lustre/. +In order to account for per job lustre traffic, add the "disk/lustre" TRES to +\fIAccountingStorageTRES\fR. .RE .TP