Kernelized Reinforcement Learning with Order Optimal Regret Bounds
–Neural Information Processing Systems
Our results show a significant polynomial in the number of episodes improvement over the state of the art.
Neural Information Processing Systems
Oct-8-2025, 02:51:01 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe
- Netherlands
- North Holland > Amsterdam (0.04)
- South Holland > Delft (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Netherlands
- North America
- Canada > Alberta (0.14)
- United States > Massachusetts
- Middlesex County > Cambridge (0.04)
- Asia > Middle East
- Genre:
- Research Report > New Finding (0.54)
- Technology: