Bradley-Terry and Multi-Objective Reward Modeling Are Complementary