Dynamic Inference with Neural Interpreters

Neural Information Processing Systems 

Modern neural network architectures can leverage large amounts of data to generalize well within the training distribution.