RLAF: Reinforcement Learning from Automaton Feedback

Open in new window