Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning

Aug-19-2025, 19:18:12 GMT–Neural Information Processing Systems

On-policy algorithms learn about a particular target policy using data collected by behaving according to the target policy.

machine learning, reinforcement learning, trajectory, (13 more...)

Neural Information Processing Systems

Aug-19-2025, 19:18:12 GMT

Conferences PDF

Country:
- North America > United States
  - Maryland (0.04)
  - Wisconsin > Dane County
    - Madison (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > Experimental Study (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)