Feb 03, 2022
Living Lab Lecture SeriesLiving Lab Lecture No. 7: Big Data Performance Analysis
(ScaDS.AI - Center for Scalable Data Analytics and Artificial Intelligence Leipzig/Dresden)
On 3rd February 2022 at 11:00 am our 7th Living Lab Lecture will take place. This time Jan Frenzel will talk about "Big Data Perdormance Analysis".
In the last years, the amount of data that needs to be processed has increased tremendously. Java-based frameworks, such as Apache Hadoop, Apache Spark and Apache Flink have been developed to simplify the work with distributed data by hiding much of the complexity related to distributed data processing, such as splitting data or moving data in the compute cluster, behind functional building blocks.
However, because of this hidden complexity, performance analysis of applications written with these frameworks is particularly challenging. The performance could be limited by the application, the framework itself or the framework’s configuration. Different approaches could be used to investigate these potential causes of low performance.
Our ScaDS.AI scientific researcher Jan Frenzel introduces you to the area of performance evaluation and performance investigation of these frameworks. He presents benefits of using an established performance analysis tool: Vampir, as an alternative to the dashboards of Apache Spark and Apache Flink.
The Living Lab Lecture Series is free for everyone.
Language: English
To join the lecture please click here: Living Lab Lecture #7