A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications