 
                                                
                                            
                                                        28. Okt. 2025
                                                        
                                                            
                                                            13:00
                                                        
                                                    
                                                        31. Okt. 2025
                                                        
                                                            
                                                            17:00
                                                        
                                                    
Englisch
Mittel
ThemenbereicheDaten in HPC / Deep Learning / Maschinelles Lernen
ThemenKünstliche Intelligenz
Big Data
Deep Learning
GPU-Programmierung
Maschinelles Lernen
Scientific Machine Learning
For day one, you need basic experience with C/C++ or Fortran. Suggested resources to satisfy prerequisites: the learn-c.org interactive tutorial, https://www.learn-c.org/. Familiarity with MPI is a plus.
On day two, you need an understanding of fundamental programming concepts in Python 3, such as functions, loops, dictionaries, and arrays; familiarity with Pandas data structures; and an understanding of how to compute a regression line.
Suggested resources to satisfy prerequisites: Python Beginner’s Guide. Familiarity with PyTorch will be a plus as it will be used in the hands-on sessions.
Experience with Deep Learning using Python 3 and, in particular, gradient descent model training will be needed on day three and four. Further, experience with PyTorch will be helpful, see https://pytorch.org/tutorials/ for instance.
Please be aware that while the second day offers an introduction or recap of Deep Learning most of the topics in this course are rather advanced. If you are completely unfamiliar with Deep Learning, the learning curve might be steep on days three and four.
Basic: 4 hours
Intermediate: 4.5 hours
Advanced: 15 hours
Lecturers and assistant trainers: Ajay Navilarekal (LRZ and NVIDIA University Ambassador), Tobias Haas (HLRS), Lorenzo Zanon (HLRS), Junghwa Lee (HLRS), Layal Ali (HLRS).
1st day: Introduction to multi-GPU programming
2nd day: Introduction to Deep Learning
3rd day: Data Parallelism: How to Train Deep Learning Models on Multiple GPUs
4th day: Model Parallelism: Building Large Neural Networks
1st day (Tue): Introduction to multi-GPU programming (13:00 - 17:00)
On the first day you will learn the basics of multi-GPU programming. This will give you a rough idea how Deep Learning can be implemented using multi-GPUs.
2nd day (Wed): Introduction to Deep Learning (9:00 - 17:00)
Explore the fundamentals of deep learning by training neural networks and using results to improve performance and capabilities.
During this day, you’ll learn the basics of deep learning by training and deploying neural networks.
3rd day (Thu): Data Parallelism: How to Train Deep Learning Models on Multiple GPUs (9:00 - 17:00)
The computational requirements of deep neural networks used to enable AI applications like self-driving cars are enormous. A single training cycle can take weeks on a single GPU or even years for larger datasets like those used in self-driving car research. Using multiple GPUs for deep learning can significantly shorten the time required to train lots of data, making solving complex problems with deep learning feasible.
On the third day we will teach you how to use multiple GPUs to train neural networks.
 
4th day (Fri): Model Parallelism: Building Large Neural Networks (9:00 - 17:00)
The fourth day's agenda covers the fundamentals and advanced techniques of large-scale model training, beginning with an introduction to training large models and progressing through sophisticated distributed training concepts. You will explore key parallelization strategies including tensor and pipeline parallelism, sequence parallelism, and mixture of experts architectures, while also examining critical optimization techniques such as activation checkpointing and offloading, alongside collective communication principles and hardware design considerations.
Exercises
The exercises will be carried out on cloud instances and on one of HLRS's clusters (on the first day).
Lunch
Starting from the second course day, we will have a lunch break between 12.30-13.30.
Besides the content of the training itself, an important aspect of this event is the scientific exchange among the participants. We try to facilitate such communication by
This course is only open to academic participants.
Register via the button at the top of this page.
Registration closed on September 30, 2025.
If you are not interested in all days, please select only those days in which you are interested while registering.
Important Information: After you are accepted, please create an account under learn.nvidia.com/join (if you don't have an account, just enter your e-mail address and continue).
NVIDIA Deep Learning Institute:
The NVIDIA Deep Learning Institute delivers hands-on training for developers, data scientists, and engineers. The program is designed to help you get started with training, optimizing, and deploying neural networks to solve real-world problems across diverse industries such as self-driving cars, healthcare, online services, and robotics.
This course is open to academic participants only.
Our course fees include coffee breaks (in classroom courses only).
For lists of EU and EU-associated countries, and PRACE countries have a look at the Horizon Europe and PRACE website.
Lucas Jordan phone 0711 685 87206, training(at)hlrs.de
HLRS is part of the Gauss Centre for Supercomputing (GCS), together with JSC in Jülich and LRZ in Garching near Munich. EuroCC@GCS is the German National Competence Centre (NCC) for High-Performance Computing. HLRS is also a member of the Baden-Württemberg initiative bwHPC. Since 2025, HLRS coordinates HammerHAI.
This course is provided within the framework of the bwHPC training program.
See the training overview and the Supercomputing Academy pages.
See also information about the HLRS training department and staff.
Nov. 11 - 20, 2025
Online
Englisch
Nov. 20 - Dez. 05, 2025
Hybrid, Stuttgart
Englisch
Nov. 26 - 28, 2025
Online
Englisch
Jan. 20 - 29, 2026
Online
Englisch