Structural Causal Bandits: Where to Intervene?
Sanghack Lee, Elias Bareinboim
–Neural Information Processing Systems
We study the problem of identifying the best action in a sequential decisionmaking setting when the reward distributions of the arms exhibit a non-trivial dependence structure, which is governed by the underlying causal model of the domain where the agent is deployed.
Neural Information Processing Systems
May-26-2025, 13:22:36 GMT