Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
hpc-compendium
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Deploy
Releases
Package Registry
Container Registry
Model registry
Operate
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
ZIH
hpcsupport
hpc-compendium
Commits
0576a75a
Commit
0576a75a
authored
5 months ago
by
Ulf Markwardt
Browse files
Options
Downloads
Patches
Plain Diff
Update overview.md
parent
69537b8e
No related branches found
Branches containing commit
No related tags found
2 merge requests
!1138
Automated merge from preview to main
,
!1134
Review documentation w.r.t. filesystems and hardware
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
doc.zih.tu-dresden.de/docs/jobs_and_resources/overview.md
+6
-5
6 additions, 5 deletions
doc.zih.tu-dresden.de/docs/jobs_and_resources/overview.md
with
6 additions
and
5 deletions
doc.zih.tu-dresden.de/docs/jobs_and_resources/overview.md
+
6
−
5
View file @
0576a75a
# Introduction HPC Resources and Jobs
# Introduction HPC Resources and Jobs
ZIH operates high performance computing (HPC) systems with
more than
100.000 cores,
10
00 GPUs, and a
ZIH operates high performance computing (HPC) systems with
about
100.000 cores,
9
00 GPUs, and a
flexible storage hierarchy with about 40 PB total capacity. The HPC system provides an optimal
flexible storage hierarchy with about 40 PB total capacity. The HPC system provides an optimal
research environment especially in the area of data analytics, artificial intelligence methods and
research environment especially in the area of data analytics, artificial intelligence methods and
machine learning as well as for processing extremely large data sets. Moreover it is also a perfect
machine learning as well as for processing extremely large data sets. Moreover it is also a perfect
...
@@ -8,10 +8,10 @@ platform for highly scalable, data-intensive and compute-intensive applications
...
@@ -8,10 +8,10 @@ platform for highly scalable, data-intensive and compute-intensive applications
capabilities for energy measurement and performance monitoring. Therefore provides ideal conditions
capabilities for energy measurement and performance monitoring. Therefore provides ideal conditions
to achieve the ambitious research goals of the users and the ZIH.
to achieve the ambitious research goals of the users and the ZIH.
The HPC system
, redesigned in December 2023,
consists of five
homogeneous
clusters with their own
The HPC system consists of five clusters with their own
[
Slurm
](
slurm.md
)
instances and cluster specific
[
Slurm
](
slurm.md
)
instances and cluster specific
login nodes. The clusters share
one
login nodes. The clusters share
a number of different
[
filesystem
](
../data_lifecycle/file_systems.md
)
which enable
s
users to
easily
switch between the
[
filesystem
s
](
../data_lifecycle/file_systems.md
)
which enable users to switch between the
components.
components.
## Selection of Suitable Hardware
## Selection of Suitable Hardware
...
@@ -53,7 +53,8 @@ The following questions may help to decide which cluster to use
...
@@ -53,7 +53,8 @@ The following questions may help to decide which cluster to use
<!-- cluster_overview_table -->
<!-- cluster_overview_table -->
|Name|Description| DNS | Nodes | # Nodes | Cores per Node | Threads per Core | Memory per Node [in MB] | Memory per Core [in MB] | GPUs per Node
|Name|Description| DNS | Nodes | # Nodes | Cores per Node | Threads per Core | Memory per Node [in MB] | Memory per Core [in MB] | GPUs per Node
|---|---|----|:---|---:|---:|---:|---:|---:|---:|
|---|---|----|:---|---:|---:|---:|---:|---:|---:|
|
**Barnard**
<br>
_2023_
| CPU|
`n[node].barnard.hpc.tu-dresden.de`
|n[1001-1630] | 630 |104| 2 |515,000 |2,475 | 0 |
|
**Capella**
<br>
_2024_
| GPU|
`c[node].barnard.hpc.tu-dresden.de`
|c[1-144] | 144 |64| 1 |768,000 | | 0 |
|
**Barnard**
<br>
_2023_
| CPU|
`n[node].barnard.hpc.tu-dresden.de`
|n[1001-1630] | 630 |104| 2 |515,000 |12,000 | 4 |
|
**Alpha**
<br>
_2021_
| GPU |
`i[node].alpha.hpc.tu-dresden.de`
|taurusi[8001-8034] | 34 | 48 | 2 | 990,000 | 10,312| 8 |
|
**Alpha**
<br>
_2021_
| GPU |
`i[node].alpha.hpc.tu-dresden.de`
|taurusi[8001-8034] | 34 | 48 | 2 | 990,000 | 10,312| 8 |
|
**Romeo**
<br>
_2020_
| CPU |
`i[node].romeo.hpc.tu-dresden.de`
|taurusi[7001-7192] | 192|128 | 2 | 505,000| 1,972 | 0 |
|
**Romeo**
<br>
_2020_
| CPU |
`i[node].romeo.hpc.tu-dresden.de`
|taurusi[7001-7192] | 192|128 | 2 | 505,000| 1,972 | 0 |
|
**Julia**
<br>
_2021_
| single SMP system |
`julia.hpc.tu-dresden.de`
| julia | 1 | 896 | 1 | 48,390,000 | 54,006 | - |
|
**Julia**
<br>
_2021_
| single SMP system |
`julia.hpc.tu-dresden.de`
| julia | 1 | 896 | 1 | 48,390,000 | 54,006 | - |
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment