Applied Data Science with PySpark¶
Agenda¶
The agenda for this will be
- Introduction to me and you
- Highlevel view of the course
- Course Material
Instructor : Troels Lægsgaard¶
Mar 2023 - Present: Data Scientist at InCommodities
Oct 2020 - Mar 2023: Data Scientist at Terma
Sep 2018 - Oct 2020: Data Scientist at A.P. Moller–Maersk
Sep 2017 - Sep 2018: Data Scientist at Danske Bank
Apr 2016 - Dec 2016: Temporary/student assistant at Sydbank
Jan 2014 - Jun 2016: Teaching Assistant at Aarhus University (5 courses, 2 years 6 months)
Master degree, Mathematical Investment, cand.scient.oecon
Highlevel view of the course¶
The goal of this two day course is to step through some basic tools for tackling a wide variety of Data Science challenges.
Highlevel view of the course¶
Topics of the course:
- Databricks Notebooks
- Spark
- Spark SQL
- Spark ML
The weight will be on what is applied in a Data Scientist position.
Books¶
Structure of the Course¶
From 9:00 to 16:00 each day, we might break early if we're fast.
Coffea breaks at 10:00-10:15 and 14:00-14:15
Lunch break at 12:00-12:30.
Structure of the Course¶
During the lectures we will:
Go through some Theory.
Make some Exercises.
Repeat.
Day 2 will start with Q&A to questions I couldn't answer on the spot.
When the course is finished, I will put up a curricilum and Q&A on OneDrive which you can access from home.