Sep 27, 2022
NHR-TrainingsangeboteML-HPC-B-Tutorial: Machine Learning on HPC - Introduction
Peter Winkler (ZIH),
Wenyu Zhang (ZIH)
Due to the heterogeneity of ML applications, the motivation to switch to an HPC system can be manifold, e.g. due to large memory requirements, GPU usage or increase of computation speed. The course presents how a typical ML workflow can be realized in the HPC environment. It is possible to switch to the HPC system at different points in the workflow - depending on the requirements. The development of ML applications is often done by collaborative work within groups, which is also taken into account in the implementation of the ML workflow.
Agenda:
- Access to the HPC system (e.g. ssh, Jupyterhub)
- Data transfer and storage of training data, models, source codes etc. (e.g. scp, dtcp, user space, workspaces)
- Setup of the required software environment (e.g. using module system, virtual environments, containers)
- Execution/testing/debugging of applications (e.g. batch jobs, interactive jobs)
- Evaluation and storage of results
-
simple monitoring to optimize applications (Pika)
The course is free of charge. Course language: English.
Please register until 16th September 2022 here: Registration ML-HPC-B
You will receive the access data via email to your registered email address before the event. If you have any questions, please do not hesitate to contact Ms. Anja Gerbes ().