ARE: Scaling Up Agent Environments and Evaluations