1a669e81c8093745261889539694be7f-Supplemental.pdf

Feb-7-2026, 16:04:14 GMT–Neural Information Processing Systems

Ifweassumethereward function is a linear combination of features, it is often the case that the number of featuresk is much lessthanthetotalnumber ofstate-action pairs. When learning a posterior from demonstrations we use Bayesian IRL [4]. Bayesian IRL uses Markov chain Monte Carlo (MCMC) sampling to sample from the posterior P(R|D). The step size was tuned to result in an accept ratio close to0.4. Ifso, then we stop gradient ascent.

artificial intelligence, demonstration, machine learning, (11 more...)

Neural Information Processing Systems

Feb-7-2026, 16:04:14 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > Illinois
    - Cook County > Chicago (0.05)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.05)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Duplicate Docs Excel Report

Title
Supplementary Materials for Bayesian Robust Optimization for Imitation Learning Daniel S. Brown

Similar Docs Excel Report more

Title	Similarity	Source
None found