Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation

Jun-14-2026, 04:37:45 GMT–Neural Information Processing Systems

Reliable uncertainty quantification is crucial for reinforcement learning (RL) in high-stakes settings. We propose a unified conformal prediction framework for infinite-horizon policy evaluation that constructs distribution-free prediction intervals for returns in both on-policy and off-policy settings.

artificial intelligence, machine learning, proceedings, (2 more...)

Neural Information Processing Systems

Jun-14-2026, 04:37:45 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.41)