Dask is a relatively new library for parallel computing in Python. It builds around familiar data structures to users of the PyData stack and enables them to scale up their work on one or many machines. This tutorial will introduce users to the core concepts of dask by working through some example problems. The tutorial will be distributed via Jupyter Notebooks.
The tutorial will introduce users to the core concepts of dask including
The tutorial will also introduce the core data structures including
We will make use of both real and generated data sets to learn and reinforce these concepts.