Solving Continuous Control via Q-learning