Safe Reinforcement Learning via Shielding