When should we expect the learned algorithm to be a mesa-optimiser?

Questions: