April 22, 2025 - by CSCS
This webinar aims to provide existing and new users with best practices for executing, debugging and analyzing multi-GPU codes on the CSCS Alps supercomputer.
The first part will focus on covering key concepts such as NUMA (Non-Uniform Memory Access), hardware topology, expected FLOPs (Floating Point Operations per Second), and network performance. It will also include practical aspects of device visibility, SLURM configuration, and the use of wrapper scripts to optimize multi-GPU workloads.
The second part will introduce parallel debugging (LINARO) and performance measurement (NVIDIA, VI-HPS) tools, exploring how these tools can assist users in maximizing their utilization of the Alps supercomputer.
For more information please visit the event page >
To join the webinar, please use the following Zoom link >
We look forward to connecting with you on Thursday, May 15, 2025!