ProvablyEfficientCausalReinforcementLearning withConfoundedObservationalData

Feb-10-2026, 17:59:49 GMT–Neural Information Processing Systems

Empowered by neural networks, deep reinforcement learning (DRL) achieves tremendous empirical success. However, DRL requires a large dataset by interacting with the environment, which is unrealistic in critical scenarios such as autonomous driving and personalized medicine. In this paper, we study how to incorporate the dataset collected in the offline setting to improve the sample efficiency in the online setting. To incorporate the observational data, we face two challenges.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Feb-10-2026, 17:59:49 GMT

Conferences PDF

Add feedback

Country:
- Asia > Middle East > Jordan (0.04)

Industry:
- Health & Medicine (0.67)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (0.46)

Duplicate Docs Excel Report

Title
b0b79da57b95837f14be95aaa4d54cf8-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found