High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization