A Additional experimental details

Neural Information Processing Systems 

RBF kernel to increase pretraining data diversity. Architectural details In all experiments, we use the same ExPT architecture. This section details how we constructed new objectives from the original D'Kitty and Ant that we In Ant-Energy, the reward at each time step is: R =1+ Survival reward Control cost Contact cost, (6) which means we incentivize the robot to conserve energy instead of running fast. D'Kitty tasks In D'Kitty, the goal is to design a morphology that allows the D'Kitty robot to reach We found the approximate oracle provided by Design-Bench not accurate enough to provide a reliable comparison of optimization methods on this task. C.1 Effects of GP hyperparameters We empirically examine the impact of two GP hyperparameters, the variance and the length scale ` Specifically, we evaluate the performance of ExPT on D'Kitty We average the performance across 3 seeds.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found