Spark For Data Engineers

Learn Data Engineering Using Open Source Spark

Spark is the leading platform for analytics over big data. In this course, go from beginner to implementing advanced data pipelines with clean, maintainable Spark code.

#1

Introduction To Spark

In this lesson we will introduce Apache Spark and cover some of the core concepts and use cases.

View lesson
#2

Loading Static Files

In this lesson we will introduce Apache Spark and describe how to load data from static CSV and JSON files.

View lesson
#3

Group By and Aggregation

In this lesson we will learn about basic group by and aggregation operations in Spark.

View lesson
#4

Joins

In this lesson we will learn how to perform joins in Apache Spark.

View lesson
#5

Spark Query Plan

In this lesson we will learn about the Spark query plan and how to interpret it.

View lesson
#6

From Spark To Databricks

In this lesson we will learn about the differences between Spark and Databricks, and the advantages of moving from Spark to Databricks.

View lesson

© 2022 Timeflow Academy. Bought To You By Timeflow.