Provably Efficient Offline Multi-agent Reinforcement Learning via Strategy-wise Bonus

Open in new window