On Reward-Free Reinforcement Learning with Linear Function Approximation

May-27-2025, 12:16:33 GMT–Neural Information Processing Systems

Reward-free reinforcement learning (RL) is a framework which is suitable for both the batch RL setting and the setting where there are many reward functions of interest. During the exploration phase, an agent collects samples without using a pre-specified reward function. After the exploration phase, a reward function is given, and the agent uses samples collected during the exploration phase to compute a near-optimal policy. Jin et al. [2020] showed that in the tabular setting, the agent only needs to collect polynomial number of samples (in terms of the number states, the number of actions, and the planning horizon) for reward-free RL. However, in practice, the number of states and actions can be large, and thus function approximation schemes are required for generalization.

artificial intelligence, machine learning, reinforcement learning, (11 more...)

Neural Information Processing Systems

May-27-2025, 12:16:33 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Reinforcement Learning (0.64)
  - Representation & Reasoning > Uncertainty
    - Fuzzy Logic (0.68)