Reviews: Provably Efficient Q-Learning with Low Switching Cost
–Neural Information Processing Systems
They also present (two flavours of) a Q-learning algorithm that achieve the regret matching the previous work however with the added benefit of having lower local switching cost.
Neural Information Processing Systems
Jan-23-2025, 09:57:25 GMT
- Technology: