Phase-Parametric Policies for Reinforcement Learning in Cyclic Environments