Skip to content
Snippets Groups Projects
Commit 8bff37ce authored by Natalie Breidenbach's avatar Natalie Breidenbach
Browse files

Update big_data_frameworks.md

parent fdaf7e2a
No related branches found
No related tags found
2 merge requests!938Automated merge from preview to main,!936Update to Five-Cluster-Operation
...@@ -37,16 +37,16 @@ as via [Jupyter notebooks](#jupyter-notebook). All three ways are outlined in th ...@@ -37,16 +37,16 @@ as via [Jupyter notebooks](#jupyter-notebook). All three ways are outlined in th
### Default Configuration ### Default Configuration
The Spark and Flink modules are available in both `scs5` and `ml` environments. The Spark and Flink modules are available in the `power` environment.
Thus, Spark and Flink can be executed using different CPU architectures, e.g., Haswell and Power9. Thus, Spark and Flink can be executed using different CPU architectures, e.g., Power.
Let us assume that two nodes should be used for the computation. Use a `srun` command similar to Let us assume that two nodes should be used for the computation. Use a `srun` command similar to
the following to start an interactive session using the partition `haswell`. The following code the following to start an interactive session. The following code
snippet shows a job submission to haswell nodes with an allocation of two nodes with 60000 MB main snippet shows a job submission with an allocation of two nodes with 60000 MB main
memory exclusively for one hour: memory exclusively for one hour:
```console ```console
marie@login$ srun --partition=haswell --nodes=2 --mem=60000M --exclusive --time=01:00:00 --pty bash -l marie@login.power$ srun --nodes=2 --mem=60000M --exclusive --time=01:00:00 --pty bash -l
``` ```
Once you have the shell, load desired Big Data framework using the command Once you have the shell, load desired Big Data framework using the command
...@@ -117,11 +117,11 @@ can start with a copy of the default configuration ahead of your interactive ses ...@@ -117,11 +117,11 @@ can start with a copy of the default configuration ahead of your interactive ses
=== "Spark" === "Spark"
```console ```console
marie@login$ cp -r $SPARK_HOME/conf my-config-template marie@login.power$ cp -r $SPARK_HOME/conf my-config-template
``` ```
=== "Flink" === "Flink"
```console ```console
marie@login$ cp -r $FLINK_ROOT_DIR/conf my-config-template marie@login.power$ cp -r $FLINK_ROOT_DIR/conf my-config-template
``` ```
After you have changed `my-config-template`, you can use your new template in an interactive job After you have changed `my-config-template`, you can use your new template in an interactive job
...@@ -175,7 +175,6 @@ example below: ...@@ -175,7 +175,6 @@ example below:
```bash ```bash
#!/bin/bash -l #!/bin/bash -l
#SBATCH --time=01:00:00 #SBATCH --time=01:00:00
#SBATCH --partition=haswell
#SBATCH --nodes=2 #SBATCH --nodes=2
#SBATCH --exclusive #SBATCH --exclusive
#SBATCH --mem=60000M #SBATCH --mem=60000M
...@@ -205,7 +204,6 @@ example below: ...@@ -205,7 +204,6 @@ example below:
```bash ```bash
#!/bin/bash -l #!/bin/bash -l
#SBATCH --time=01:00:00 #SBATCH --time=01:00:00
#SBATCH --partition=haswell
#SBATCH --nodes=2 #SBATCH --nodes=2
#SBATCH --exclusive #SBATCH --exclusive
#SBATCH --mem=60000M #SBATCH --mem=60000M
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment