Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

When we dispatch a kernel to the GPU, we need to finish its the execution of all the workitems even if one or more of the workitems deoptimize.  The kernel code can be executed across a possibly very large number of workitems, each of which can have its own state.  The non-deoptimizing workitems can finish as they normally would but we need to be able to save the state of the deoptimizing workitems.   When we dispatch a kernel, we We don't know how many workitems are going to need to deoptimize and need to save their state.  So how do we , yet we want to avoid having to allocate state-saving space for the entire possibly very large range of workitems.

...