ProvablyFeedback-EfficientReinforcementLearning viaActiveRewardLearning

Feb-8-2026, 17:14:21 GMT–Neural Information Processing Systems

Here H is the horizon oftheRL environment, anddimR specifies thecomplexity ofthefunction class representing the reward function.

machine learning, pmlr, reinforcement learning, (15 more...)

Neural Information Processing Systems

Feb-8-2026, 17:14:21 GMT

Conferences PDF

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.49)

Duplicate Docs Excel Report

Title
Provably Feedback-Efficient Reinforcement Learning via Active Reward Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found