AMD Instinct™ GPU Training

This course will be provided ONLINE via Zoom.

This course will deep dive into the AMD Instinct™ GPU architecture and its ROCm™ ecosystem, including the tools to port or/and migrate their application (HPC/AI) on AMD GPU through HIP (Heterogeneous-computing Interface for Portability) as run them in a multi-node environment. An introduction to RocOps, the AMD software stack for AI, will be provided to attendees, including optimized execution engine for deep learning neural networks.

After this course, participants will

  • have gained knowledge about software enablement on AMD GPU using HIP
  • be able to port simple code from CUDA to HIP
  • have gained knowledge about optimized execution engine for deep learning neural networks.

Veranstaltungsort

Online course
Organizer: HLRS, University of Stuttgart, Germany

Veranstaltungsbeginn

29. Sep 2022
08:45

Verstaltungsende

30. Sep 2022
16:15

Sprache

Englisch

Einstiegslevel

Advanced

Themen

Code Optimization

Deep Learning

GPU Programming

Machine Learning

Zurück zur Liste

Prerequisites and content levels

Prerequisites

Some knowledge in GPU programming, e.g., CUDA, and AI/Deep Learning

See also the suggested prereading below (resources and public videos).

Content levels

Basic: 1 hours
Intermediate: 4 hours
Advanced: 6 hours

Learn more about course curricula and content levels.

Resources
Public videos

Instructors

Essam Morsi (AMD), Adil Lashab (AMD)

Agenda

All times are CEST.
Day 1

08:45 - 9:00 Drop in to Zoom

  • 9:00 - 10:30 Introduction AMD GPU
    • GCN/CDNA Overview
    • Memory Hierarchy
    • HIP GPU Compute Terminology
    • Compute Units
  • 10:45 - 12:45 HIP
    • Introduction to HIP & Core HIP API
    • Memory Management in HIP
    • Asynchronous Computing with HIP
    • Tips & Tricks
  • 12:45 - 13:45 Lunch Break
  • 13:45 - 15:15  ROCm
    • Introduction to ROCm
    • Multi-GPU RCCL/MPI with ROCm
    • Debugging
    • Profiling
  • 15:30 - 16:15  AI
    • RocOps introduction (AMD AI SW Stack)
    • Training
      • Single GPU training with TF and PY for all models
      • mGPU training with TF and PY
      • Distributed mNode training with TF and PY
    • Inference
      • MLPerf MIGraphX & TVM Backend
Day 2 (Hands-on)

08:45 - 9:00 Login and Microsoft Teams setup

  • 9:00 - 10:30 ROCm setup
  • 10:00 - 12:00 Hipification: CUDA To HIP
    • Square
    • RTM TTI
  • 12:00 - 13:00 Lunch Break
  • 13:00 - 14:30 HIP
    • Profiling HW counters 
    • Example with bank conflicts on shared memory / how to profile / resolve it
    • RocProfiler RocTracer ISA code
  • 14:45 - 16:00 AI
    • Build optimized execution program for DLNN ?
  • 16:00 - 16:15 QA

Registration information

This course is already fully booked.

Fees

This course is free of charge.

Contact

Khatuna Kakhiani phone 0711 685 65796, kakhiani(at)hlrs.de
 

PRACE PATC and bwHPC

HLRS is part of the Gauss Centre for Supercomputing (GCS), which is one of the six PRACE Advanced Training Centres (PATCs) that started in Feb. 2012.

HLRS is also member of the Baden-Württemberg initiative bwHPC.

This course is also provided within the framework of the bwHPC training program. This course is not part of the PATC curriculum and is not sponsored by the PATC program.

Further courses

See the training overview and the Supercomputing Academy pages.

Ähnliche Trainingskurse

Alle Trainingskurse

September 12 - Oktober 24, 2022

Online


September 19 - Oktober 24, 2022

Online


Oktober 24 - 25, 2022

Online


November 07 - 11, 2022

Hybrid Event - Stuttgart, Germany


November 14 - 17, 2022

Online


Dezember 05 - 09, 2022

Online