Supplementary Materials for Bayesian Robust Optimization for Imitation Learning Daniel S. Brown
–Neural Information Processing Systems
When using the robust performance metric described in Section 4.2, we have We solve the above linear program to obtain the results presented in Section 5.1. Work done while at UT Austin. We use Scipy's linear programming software (v 1.4.1) MDP is solved to obtain the sample's likelihood and determine the transition probabilities within the Markov chain. We used a learning rate of 0.01.
Neural Information Processing Systems
Oct-2-2025, 07:42:55 GMT
- Country:
- North America
- Canada (0.04)
- United States
- Illinois > Cook County
- Chicago (0.04)
- New Hampshire (0.04)
- Texas > Travis County
- Austin (0.04)
- Illinois > Cook County
- North America