A Environment Details
–Neural Information Processing Systems
Our unsupervised pre-training algorithm is provided in Algorithm 1. We assume that the pre-training environment provides access to both proprioceptive states (the input of the skill policy) and goal state features as defined in Appendix B. During training, goal spaces and goals are randomly selected for each episode The low-level skill policy
Neural Information Processing Systems
May-29-2025, 00:29:37 GMT