Strategy Masking: A Method for Guardrails in Value-based Reinforcement Learning Agents