Nimble: Lightweight and Parallel GPU Task Scheduling for Deep Learning

Neural Information Processing Systems 

Nimble introduces a novel technique called ahead-of-time (AoT) scheduling.