Pref-GUIDE: Continual Policy Learning from Real-Time Human Feedback via Preference-Based Learning