Inference of Deterministic Finite Automata via Q-Learning