29.04.2025

Executing and debugging Multi-GPU Codes on Alps

Webinar
Free

The Swiss National Supercomputing Centre (ETH Zurich / CSCS) is pleased to announce the webinar “Executing and debugging Multi-GPU Codes on Alps,” to be held on Tuesday, April 29, 2025, from 11:00 to 12:00 CEST.

Content

This webinar aims to provide existing and new users with best practices for executing, debugging and analyzing multi-GPU codes on the CSCS Alps supercomputer. The first part will focus on covering key concepts such as NUMA (Non-Uniform Memory Access), hardware topology, expected FLOPs (Floating Point Operations per Second), and network performance. It will also include practical aspects of device visibility, SLURM configuration, and the use of wrapper scripts to optimize multi-GPU workloads. The second part will introduce parallel debugging (LINARO) and performance measurement (NVIDIA, VI-HPS) tools, exploring how these tools can assist users in maximizing their utilization of the Alps supercomputer.

Trainers

Daniel Ganellari (Research Software Engineer, CSCS)

Jean-Guillaume Piccinali (Senior HPC Performance Engineer, CSCS)

Target audience

Members of the CSCS User Community

Date and time

April 29, 2025, from 11:00 to 12:00 CEST

Format and timeframe

1h webinar via Zoom videocoference system

Connection details

 https://ethz.zoom.us/j/65461918956

*****************************************

A video recording will be made available after the webinar.

We look forward to connecting with you on Tuesday, April 29, 2025!