Local policy search with Bayesian optimization Sarah Müller

Oct-9-2025, 16:09:35 GMT–Neural Information Processing Systems

Nevertheless, instead of systematically reasoning and actively choosing informative samples, policy gradients for local search are often obtained from random perturbations. These random samples yield high variance estimates and hence are sub-optimal in terms of sample complexity.

artificial intelligence, machine learning, optimization, (15 more...)

Neural Information Processing Systems

Oct-9-2025, 16:09:35 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States
    - California (0.04)
    - Georgia > Fulton County
      - Atlanta (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe > Germany
  - Baden-Württemberg
    - Tübingen Region > Tübingen (0.14)
    - Stuttgart Region > Stuttgart (0.04)

Industry:
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Statistical Learning (1.00)
  - Representation & Reasoning
    - Search (1.00)
    - Optimization (1.00)

Duplicate Docs Excel Report

Title
ad0f7a25211abc3889cb0f420c85e671-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found