Exponentially Weighted Imitation Learning for Batched Historical Data

Qing Wang, Jiechao Xiong, Lei Han, peng sun, Han Liu, Tong Zhang

Feb-12-2026, 18:47:41 GMT–Neural Information Processing Systems

We consider deep policy learning with only batched historical trajectories. The main challenge of this problem is that the learner no longer has a simulator or "environment oracle" as in most reinforcement learning settings.

artificial intelligence, machine learning, reinforcement learning, (19 more...)

Neural Information Processing Systems

Feb-12-2026, 18:47:41 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada > Quebec > Montreal (0.04)

Industry:
- Leisure & Entertainment (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)

Duplicate Docs Excel Report

Title
Exponentially Weighted Imitation Learning for Batched Historical Data
Exponentially Weighted Imitation Learning for Batched Historical Data

Similar Docs Excel Report more

Title	Similarity	Source
None found