Simulation-Based Benchmarking of Reinforcement Learning Agents for Personalized Retail Promotions