Supplemental Materials High Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization Additional Experiment Results