Parametrized Quantum Policies for Reinforcement Learning