OnReward-FreeReinforcementLearningwith LinearFunctionApproximation

Feb-10-2026, 11:12:37 GMT–Neural Information Processing Systems

During the exploration phase, an agent collects samples without using a pre-specified reward function. After the exploration phase, a reward function is given, and the agent uses samples collected during the exploration phase to computeanear-optimalpolicy.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Feb-10-2026, 11:12:37 GMT

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom (0.04)
- North America
  - United States
    - Pennsylvania > Allegheny County
      - Pittsburgh (0.04)
    - Massachusetts > Middlesex County
      - Belmont (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Asia > Middle East
  - Jordan (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.31)

Duplicate Docs Excel Report

Title
On Reward-Free Reinforcement Learning with Linear Function Approximation

Similar Docs Excel Report more

Title	Similarity	Source
None found