Learning Collusion in Episodic, Inventory-Constrained Markets