Zoom into the kernel

The core ideas: task dependency analysis

Given all existing inputs,

Two passes:

  1. build graph of all possible flows
  2. trim unrequired tasks
Full dependency graph Trimmed dependency graph

Eventually we have a DAG that we can pass to Dask