PoolFlip: A Multi-Agent Reinforcement Learning Security Environment for Cyber Defense