Dealing with Non-Stationarity in Multi-Agent Deep Reinforcement Learning