A Experimental Details
–Neural Information Processing Systems
A.1 Environments and T asks We provide the details about the environments and tasks used in our experiments in Table 1. T able 1: Environments and tasks from the DeepMind control suite [29] used in our experiments. Near-expert data: Same as the above near-expert dataset, but we only include 2M steps experience (2K episodes in total) for each task. Goal-MLP Training We adapt the training of Goal-MLP to make it learn to reach goals with varying time budgets. MaskDP is designed to be accessible to the RL research community.
Neural Information Processing Systems
Nov-14-2025, 06:41:00 GMT