Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies