Spark For Data Engineers

Learn Data Engineering Using Open Source Spark

In this hands-on training course we will learn how to analyse data and implement advanced data pipelines by deploying clean, maintainable Spark code.

Hands-On Lab

Introduction To Apache Spark

Introducing Apache Spark and cover some of the core concepts and use cases.

View
Hands-On Lab

Loading Static Files

Introducing Apache Spark and describe how to load data from static CSV and JSON files.

View
Hands-On Lab

Group By and Aggregation

Group by and aggregation operations in Spark.

View
Hands-On Lab

Joins

How to perform joins in Apache Spark.

View
Hands-On Lab

Spark Query Plan

The Spark query plan and how to interpret it.

View
Hands-On Lab

From Spark To Databricks

Introduction Spark is the leading open source platform for processing and working with big data. Because Spark is relatively complex to…

View

© 2023 Timeflow Academy.