DRO-REBEL: Distributionally Robust Relative-Reward Regression for Fast and Efficient LLM Alignment