DesignofExperiments forStochasticContextualLinearBandits
–Neural Information Processing Systems
In the stochastic linear contextual bandit setting there exist several minimax procedures for exploration with policies that are reactive to the data being acquired.
Neural Information Processing Systems
Feb-10-2026, 23:38:06 GMT
- Country:
- Genre:
- Research Report (0.46)
- Technology: