MARPLE: A Benchmark for Long-Horizon Inference Emily Jin Zhuoyi Huang

Open in new window