Supplemental Materials High Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization Additional Experiment Results
–Neural Information Processing Systems
Fig. S2 shows very different policy patterns and optimums for the two contexts (darker red means higher
Neural Information Processing Systems
Aug-17-2025, 09:18:12 GMT