Inference via knowledge compilation has also been used for many applications in neuro-symbolic AI,suchasconstrained generation [2,54]andneural logic programming [34,28].
Joulani et al. (2013) have studied multi-armed bandits with delayed feedback under the assumption that the rewards are stochastic and the delays are sampled from a fixed distribution.