Skip to content
Snippets Groups Projects
Commit 9e8caddc authored by Martin Schroschk's avatar Martin Schroschk
Browse files

Review: remove target blank

parent fdf0c38d
No related branches found
No related tags found
4 merge requests!333Draft: update NGC containers,!322Merge preview into main,!319Merge preview into main,!258Data Analytics restructuring
# Python for Data Analytics # Python for Data Analytics
Python is a high-level interpreted language widely used in research and science. Using ZIH system Python is a high-level interpreted language widely used in research and science. Using ZIH system
allows you to work with python quicker and more effective. Here, a general introduction to working allows you to work with Python quicker and more effective. Here, a general introduction to working
with python on ZIH systems is given. Further documentation is available for specific with Python on ZIH systems is given. Further documentation is available for specific
[machine learning frameworks](machine_learning.md). [machine learning frameworks](machine_learning.md).
## Python Console and Virtual Environments ## Python Console and Virtual Environments
Often, it is useful to create an isolated development environment, which can be shared among Often, it is useful to create an isolated development environment, which can be shared among
a research group and/or teaching class. For this purpose, [python virtual environments](python_virtual_environments.md) a research group and/or teaching class. For this purpose,
can be used. [Python virtual environments](python_virtual_environments.md) can be used.
The interactive Python interpreter can also be used on ZIH systems via an interactive job: The interactive Python interpreter can also be used on ZIH systems via an interactive job:
```console ```console
marie@login$ srun --partition=haswell --gres=gpu:1 --ntasks=1 --cpus-per-task=7 --pty --mem-per-cpu=8000 bash marie@login$ srun --partition=haswell --gres=gpu:1 --ntasks=1 --cpus-per-task=7 --pty --mem-per-cpu=8000 bash
marie@compute$ module load Python marie@haswell$ module load Python
marie@compute$ python marie@haswell$ python
Python 3.8.6 (default, Feb 17 2021, 11:48:51) Python 3.8.6 (default, Feb 17 2021, 11:48:51)
[GCC 10.2.0] on linux [GCC 10.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information. Type "help", "copyright", "credits" or "license" for more information.
...@@ -40,10 +40,10 @@ Jupyter notebook on a node, using a GPU when needed. ...@@ -40,10 +40,10 @@ Jupyter notebook on a node, using a GPU when needed.
[Pandas](https://pandas.pydata.org/){:target="_blank"} is a widely used library for data [Pandas](https://pandas.pydata.org/){:target="_blank"} is a widely used library for data
analytics in Python. analytics in Python.
In many cases, an existing source code using Pandas can be easily modified for parallel execution In many cases, an existing source code using Pandas can be easily modified for parallel execution by
by using the [pandarallel](https://github.com/nalepae/pandarallel/tree/v1.5.2){:target="_blank"} using the [pandarallel](https://github.com/nalepae/pandarallel/tree/v1.5.2) module. The number of
module. The number of threads that can be used in parallel depends on the number of cores threads that can be used in parallel depends on the number of cores (parameter `--cpus-per-task`)
(parameter `--cpus-per-task`) within the Slurm request, e.g. within the Slurm request, e.g.
```console ```console
marie@login$ srun --partition=haswell --cpus-per-task=4 --mem=2G --hint=nomultithread --pty --time=8:00:00 bash marie@login$ srun --partition=haswell --cpus-per-task=4 --mem=2G --hint=nomultithread --pty --time=8:00:00 bash
...@@ -56,6 +56,7 @@ pandarallel module. If the pandarallel module is not installed already, use a ...@@ -56,6 +56,7 @@ pandarallel module. If the pandarallel module is not installed already, use a
[virtual environment](python_virtual_environments.md) to install the module. [virtual environment](python_virtual_environments.md) to install the module.
??? example ??? example
```python ```python
import pandas as pd import pandas as pd
import numpy as np import numpy as np
...@@ -80,7 +81,7 @@ pandarallel module. If the pandarallel module is not installed already, use a ...@@ -80,7 +81,7 @@ pandarallel module. If the pandarallel module is not installed already, use a
df.parallel_apply(func=transform, axis=1) df.parallel_apply(func=transform, axis=1)
``` ```
For more examples of using pandarallel check out For more examples of using pandarallel check out
[https://github.com/nalepae/pandarallel/blob/master/docs/examples.ipynb](https://github.com/nalepae/pandarallel/blob/master/docs/examples.ipynb){:target="_blank"}. [https://github.com/nalepae/pandarallel/blob/master/docs/examples.ipynb](https://github.com/nalepae/pandarallel/blob/master/docs/examples.ipynb).
### Dask ### Dask
......
  • Christoph Lehmann @clehm--tu-dresden.de ·
    Developer

    {:target="_blank"} habe ich immer dann bewusst verwendet, wenn einen externe Seite aufgerufen wird - damit wird kann man in der Doku blättern und andere Seiten stören dann nicht.

  • Martin Schroschk @mflehmig--tu-dresden.de ·
    Author Owner

    Den Sinn habe ich verstanden. Das Verhalten, wenn man Links klickt, soll in der Hand der Nutzenden liegen.

0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment