Deterministic Uncertainty Propagation for Improved Model-Based Offline Reinforcement Learning

Oct-10-2025, 07:44:20 GMT–Neural Information Processing Systems

We present a theoretical result demonstrating the strong dependency of suboptimality on the number of Monte Carlo samples taken per Bellman target calculation.

dataset, offline reinforcement, reinforcement, (14 more...)

Neural Information Processing Systems

Oct-10-2025, 07:44:20 GMT

Conferences PDF

Country:
- Europe > Denmark
  - Southern Denmark (0.04)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Uncertainty
    - Bayesian Inference (0.46)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.45)
    - Learning Graphical Models > Directed Networks
      - Bayesian Learning (0.46)

Duplicate Docs Excel Report

Title
82240d93542b74d0c4fdffca39cb779f-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found