Webinar
29.04.2025
Executing and debugging Multi-GPU Codes on Alps
The Swiss National Supercomputing Centre (ETH Zurich / CSCS) is pleased to announce the webinar “Executing and debugging Multi-GPU Codes on Alps,” to be held on Tuesday, April 29, 2025, from 11:00 to 12:00 CEST.
Content
This webinar aims to provide existing and new users with best practices for executing, debugging and analyzing multi-GPU codes on the CSCS Alps supercomputer. The first part will focus on covering key concepts such as NUMA (Non-Uniform Memory Access), hardware topology, expected FLOPs (Floating Point Operations per Second), and network performance. It will also include practical aspects of device visibility, SLURM configuration, and the use of wrapper scripts to optimize multi-GPU workloads. The second part will introduce parallel debugging (LINARO) and performance measurement (NVIDIA, VI-HPS) tools, exploring how these tools can assist users in maximizing their utilization of the Alps supercomputer.
Trainers
Daniel Ganellari (Research Software Engineer, CSCS)
Jean-Guillaume Piccinali (Senior HPC Performance Engineer, CSCS)
Target audience
Members of the CSCS User Community
Date and time
April 29, 2025, from 11:00 to 12:00 CEST
Format and timeframe
1h webinar via Zoom videocoference system
Connection details
https://ethz.zoom.us/j/65461918956
*****************************************
A video recording will be made available after the webinar.
We look forward to connecting with you on Tuesday, April 29, 2025!