PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks