Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 794 Bytes

README.md

File metadata and controls

19 lines (13 loc) · 794 Bytes

RMACC Dask Tutorial 2021

Binder

Dask provides advanced parallelism for analytics, enabling performance at scale for the tools you love.

In this 90 minute tutorial we will cover an overview of Dask including dataframes, arrays, and distributed scheduling. We will also do some lightweight analysis of a 16 TB dataset of sea surface temperatures.

Schedule

These are the rough timings for the tutorial.

  • Overview of Dask with Dask Dataframe (10 mins)
  • Introductory Lab (10 mins) and results (5 mins)
  • Dask GUI and dashboards (10 mins)
  • Dask Array (10 mins)
  • Distributed (10 mins)
  • Analysis of sea surface temperature (15 mins)
  • Wrapup and close (5 mins)