Preference Learning from Physics-Based Feedback: Tuning Language Models to Design BCC/B2 Superalloys