Structural Causal Bandits: Where to Intervene?
–Neural Information Processing Systems
We study the problem of identifying the best action in a sequential decision-making setting when the reward distributions of the arms exhibit a non-trivial dependence structure, which is governed by the underlying causal model of the domain where the agent is deployed.
Neural Information Processing Systems
Nov-20-2025, 22:56:48 GMT
- Technology: