Safe and Sample-efficient Reinforcement Learning for Clustered Dynamic Environments