Efficient Batched Algorithm for Contextual Linear Bandits with Large Action Space via Soft Elimination
–Neural Information Processing Systems
Unlike existing batched algorithms that rely on action elimination, which are not implementable for large action sets, our algorithm only uses a linear optimization oracle over the action set to design the policy.
Neural Information Processing Systems
Oct-9-2025, 05:06:35 GMT
- Country:
- Asia > Middle East
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California > Los Angeles County > Los Angeles (0.14)
- Genre:
- Research Report (0.46)
- Industry:
- Health & Medicine (0.67)
- Technology: