Local policy search with Bayesian optimization Sarah Müller
–Neural Information Processing Systems
Nevertheless, instead of systematically reasoning and actively choosing informative samples, policy gradients for local search are often obtained from random perturbations. These random samples yield high variance estimates and hence are sub-optimal in terms of sample complexity.
Neural Information Processing Systems
Oct-9-2025, 16:09:35 GMT
- Country:
- Europe > Germany
- Baden-Württemberg
- Stuttgart Region > Stuttgart (0.04)
- Tübingen Region > Tübingen (0.14)
- Baden-Württemberg
- North America
- Canada > Ontario
- Toronto (0.04)
- United States
- California (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Canada > Ontario
- Europe > Germany
- Industry:
- Government (0.46)
- Technology: