Imitation-Projected Policy Gradient for Programmatic Reinforcement Learning