Reward Models Identify Consistency, Not Causality

Open in new window