To get started running Dask on common Cloud providers like Amazon, Google, or Microsoft, we currently recommend deploying Dask with Kubernetes and Helm.
All three major cloud vendors now provide managed Kubernetes services. This allows us to reliably provide the same experience across all clouds, and ensures that solutions for any one provider remain up-to-date.
Alternatively, if you are deploying on a cloud-hosted Hadoop cluster like Amazon EMR or Google Cloud DataProc, you will want to use Dask-Yarn. Documentation on deploying on Amazon EMR specifically can be found here, the process is similar for Google Cloud DataProc.
You may want to install additional libraries in your Jupyter and worker images to access the object stores of each cloud: