Benchmarking Multi-Agent Preference-based Reinforcement Learning for Human-AI Teaming