Interaction Dynamics as a Reward Signal for LLMs

Open in new window