Sample-Efficient Preference-based Reinforcement Learning with Dynamics Aware Rewards