SuperHF: Supervised Iterative Learning from Human Feedback