Eluder dimension: localise it!
–Neural Information Processing Systems
We establish a lower bound on the eluder dimension of generalised linear model classes, showing that standard eluder dimension-based analysis cannot lead to first-order regret bounds. To address this, we introduce a localisation method for the eluder dimension; our analysis immediately recovers and improves on classic results for Bernoulli bandits, and allows for the first genuine first-order bounds for finite-horizon reinforcement learning tasks with bounded cumulative returns.
Neural Information Processing Systems
Jun-23-2026, 07:45:54 GMT
- Country:
- North America > Canada (0.28)
- Europe > United Kingdom
- England (0.28)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Information Technology (0.46)
- Technology: