top of page

Learning Apache Spark Quick Start

Short facts

Duration

Course level

Last update

3 hours

All-Levels

December 9, 2020 at 11:00:00 PM

Requirements

Python, SQL Basics, Docker Basics

Description

Apache Spark quick start course in Python with Jupyter notebooks, DataFrames, SparkSQL and RDDs

What will I learn?

You will learn the fundamentals how Spark works and Spark's architecture. You will be working with Jupyter notebooks on Docker. You train Spark transformations and actions, work with SparkSQL on JSON and CSV files. After this course you have all the fundamental knowledge to write your own jobs

Price

Included in Academy Subscription

Visit our new Data Engineering Academy at

learndataengineering.com

What's included?

Videos

Source Codes

Example Data

PDF Presentation

Course Content

  1. Why Spark

  2. How Spark Works

  3. Dev Environment Docker & Jupyter

  4. Working with DataFrames

  5. Introduction into SparkSQL

  6. Coding With RDDs

  7. Conclusion

Need Something More Personalized?
Check out the Coaching Program:
bottom of page