Use the Online Network If You Can: Towards Fast and Stable Reinforcement Learning