Semi-Parametric Efficient Policy Learning with Continuous Actions