Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation
–Neural Information Processing Systems
In the exploration phase, the agent interacts with the environment and collects samples without the reward.
Neural Information Processing Systems
Oct-2-2025, 06:00:56 GMT
- Country:
- North America > United States > California > Los Angeles County > Los Angeles (0.29)
- Technology: