Introduction to Hybrid Programming in HPC

Enterprises & SME Research & Science
Introduction to Hybrid Programming in HPC


(Room changed to Rühle Saal, Nobelstr. 19, 70569 Stuttgart, modified on April 4, 2017)

Most HPC systems are clusters of shared memory nodes. Such SMP nodes can be small multi-core CPUs up to large many-core CPUs. Parallel programming may combine the distributed memory parallelization on the node interconnect (e.g., with MPI) with the shared memory parallelization inside of each node (e.g., with OpenMP or MPI-3.0 shared memory). This course analyzes the strengths and weaknesses of several parallel programming models on clusters of SMP nodes. Multi-socket-multi-core systems in highly parallel environments are given special consideration. MPI-3.0 has introduced a new shared memory programming interface, which can be combined with inter-node MPI communication. It can be used for direct neighbor accesses similar to OpenMP or for direct halo copies, and enables new hybrid programming models. These models are compared with various hybrid MPI+OpenMP approaches and pure MPI. Numerous case studies and micro-benchmarks demonstrate the performance-related aspects of hybrid programming.

Tools for hybrid programming such as thread/process placement support and performance analysis are presented in a "how-to" section. This course provides scientific training in Computational Science, and in addition, the scientific exchange of the participants among themselves.

Agenda & Content
Agenda & Content

09:30 Registration
10:00 Welcome
10:05 Motivation
10:15 Introduction
10:45 Programming Models
           - Pure MPI
11:15 Coffee break
11:35  - MPI + MPI-3.0 Shared Memory
13:10 Lunch
14:00  - MPI + OpenMP
15:10 Coffee break
15:30    MPI + OpenMP continued
16:00  - MPI + Accelerators
16:15 Tools
16:25 Conclusions
16:45 Q&A
17:00 End


Basic MPI and OpenMP knowledge as presented in several MPI and OpenMP parallelization courses, e.g., in an HLRS course program, see and


The course language is English.


Dr. habil. Georg Hager (RRZE/HPC, Uni. Erlangen), Dr. Rolf Rabenseifner


Each participant will get a paper copy of all slides.


via online registration form

Extended Deadline with Late Registration

for registration is June 6, 2017 (extended deadline)
Late registrations after the deadline are still possible but maybe with reduced quality of the handouts.


Students without Diploma/Master: 20 EUR
Students with Diploma/Master (PhD students) at German universities: 30 EUR
Members of German universities and public research institutes: 30 EUR
Members of universities and public research institutes within EU or PRACE member countries: 60 EUR.
Members of other universities and public research institutes: 120 EUR
Others: 240 EUR
(includes coffee breaks)


Travel Information and Accommodation

see our How to find us page.


HLRS is part of the Gauss Centre for Supercomputing (GCS), which is one of the six PRACE Advanced Training Centres (PATCs) that started in Feb. 2012.
HLRS is also member of the Baden-Württemberg initiative bwHPC-C5.
This course is provided within the framework of the bwHPC-C5 user support. This course is not part of the PATC curriculum and is not sponsored by the PATC program.

Local Organizer

Rolf Rabenseifner phone 0711 685 65530,
Lucienne Dettki phone 0711 685 63894,

Shortcut-URL & Course Number