A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management