Advancing Translation Preference Modeling with RLHF: A Step Towards Cost-Effective Solution