A Environment Details

Neural Information Processing Systems 

Our unsupervised pre-training algorithm is provided in Algorithm 1. We assume that the pre-training environment provides access to both proprioceptive states (the input of the skill policy) and goal state features as defined in Appendix B. During training, goal spaces and goals are randomly selected for each episode The low-level skill policy

Similar Docs  Excel Report  more

TitleSimilaritySource
None found