Remedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling