Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
–Neural Information Processing Systems
While the primary concern in this domain is usually to find realizable representations (i.e., those that allow predicting the reward function at any context-action
Neural Information Processing Systems
Oct-2-2025, 07:51:53 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California > Orange County > Irvine (0.14)
- Europe > United Kingdom
- Genre:
- Research Report (0.46)
- Technology: