Reward Models Identify Consistency, Not Causality