R on HPC - Introduction
NHR-Tutorial (Online)
Monday, 07/11/2022, 10:00 am - 3:00 pm
Vortragende: Neringa Jurenaite, Iryna Okhrin, Taras Lazariv
In this tutorial, we will introduce R users to the advantages of working on R on a High Performance Computing cluster. We will provide an overview of the most common Machine Learning methods and then look into how exactly we can explore their parallelization for the purposes of speeding up the run time. We will also show how some of the benchmarking packages in R work. In the end, the participants will have the opportunity to do it all themselves in the Hands-on Session.
Agenda
-
Accessing R and RStudio on our HPC system
-
Overview of some of the main Machine Learning models (e.g. Linear and Logistic regression, Random Forest, etc.)
-
Introduction to model benchmarking in R
-
Introduction to parallelization in R: data-based and model-based
-
Hands-on Session: Exercises
Handouts
The course material (slides, sample application) will be available.
Pre-Knowledge
- Understanding of ML methods
- Basic experience in using R
- We recommend attending ML-HPC-B NHR Tutorial in advance or familiarize with Taurus and its compendium page
Post-Knowledge
- Application of main ML methods in R and awareness of corresponding issues
- Implementation of parallelization and benchmarking of ML models in R on an HPC cluster using specific examples
HPC-Certification Forum Links
The following links show the skill descriptions that should be taught in the respective course.
Registrierung
Link: https://event.zih.tu-dresden.de/nhr/r-hpc
Registration is closing on 06/24/2022.
You will receive the access data shortly before the event by email to your registered email address.
Further Information
Course language: English
Target group: HPC Basics, R User, ML model User
If you have any further questions, please contact Anja Gerbes ().