While Dask base images come with the full set of Dask collection components and common dependencies like Pandas and NumPy, you might still have to modify your environments if you need additional packages or when you need a specific version of a given package.
Domino allows you to easily package and manage dependencies as part of your Dask-enabled compute environments. This approach creates the flexibility to manage dependencies for individual projects or workloads without having to deal with the complexity of a shared cluster.
To add a new dependency, you need to add the appropriate statements in the Docker instructions section of the relevant Dask base and Dask execution compute environments.
For example, if you wanted to add a particular version of Dask ML, you might include the following.
### Optionally specify version if desired
RUN pip install dask-ml==1.9.0
There are several optional dependencies that might be required for subsets of Dask functionality. You can find more information in the Dask documentation.