High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Open in new window