Reviews: Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle
–Neural Information Processing Systems
I think that reporting it in the introduction is premature, I suggest to describe the meaning of the theorem without the statement.
Neural Information Processing Systems
Jan-23-2025, 03:31:24 GMT