Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model