Automaton Constrained Q-Learning