On Kernelized Multi-Armed Bandits with Constraints

Dec-23-2025, 16:37:47 GMT–Neural Information Processing Systems

We study a stochastic bandit problem with a general unknown reward function and a general unknown constraint function. Both functions can be non-linear (even non-convex) and are assumed to lie in a reproducing kernel Hilbert space (RKHS) with a bounded norm.

artificial intelligence, big data, data mining, (7 more...)

Neural Information Processing Systems

Dec-23-2025, 16:37:47 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Data Science > Data Mining
    - Big Data (0.42)
  - Artificial Intelligence > Representation & Reasoning
    - Constraint-Based Reasoning (0.38)