Learning to Coordinate Under Threshold Rewards: A Cooperative Multi-Agent Bandit Framework