Provably Efficient Reinforcement Learning with Linear Function Approximation under Adaptivity Constraints
–Neural Information Processing Systems
We study reinforcement learning (RL) with linear function approximation under the adaptivity constraint. We consider two popular limited adaptivity models: the batch learning model and the rare policy switch model, and propose two efficient online RL algorithms for episodic linear Markov decision processes, where the transition probability and the reward function can be represented as a linear function of some known feature mapping.
Neural Information Processing Systems
May-29-2025, 04:23:40 GMT
- Country:
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- Genre:
- Research Report > New Finding (0.68)
- Industry:
- Education (0.46)
- Health & Medicine (0.33)