Wednesday 23rd November 2022
Title: Apache Spark Programming with Databricks
Prerequisites:
Skills – Beginner level SQL or Python, no previous Spark or Databricks experience required.
Tech – Laptop, computer or tablet with keyboard and stable internet. Up to date versions of chrome / safari / firefox. Edge will see some performance degradation, IE and mobile browsers not supported.
Overview:
This course is driven by an e-Commerce case study that explores the fundamentals of Spark Programming with Databricks. You will start by identifying the major components of the Databricks and Spark ecosystem, and exploring data in the Databricks environment. You will then demonstrate core Spark SQL concepts and learn to navigate the DataFrame API. After ingesting data from various formats, you will process and analyze datasets by applying a variety of DataFrame transformations and Column expressions. Lastly, you will process different types of data using specialized sets of built-in functions.
Agenda:
Course Timings – 9:30am – 4:30pm GMT
Q&A – 4:30pm – 5:00pm GMT
Lunch is at 12pm GMT, 7 mins breaks at roughly the top of each hour