Add no-op versions for DMA/Scratch
Comparing the same algorithm on two architectures, one with deep memory and the other not, is difficult right now. This is because the scratch operations between the same two areas still trigger memory movements.
It would be helpful instead to have a scratchpad version that doesn't trigger any DMAs. Similarly, it should be possible to provide a DMA that doesn't actually move anything.