Offline Imitation Learning from Multiple Baselines with Applications to Compiler Optimization