Goto

Collaborating Authors

 artificialintelligence




Lifted Weighted Mini-Bucket

Nicholas Gallo, Alexander T. Ihler

Neural Information Processing Systems

Many applications require computing likelihoods and marginal probabilities over a distribution defined by a graphical model, tasks which are intractable in general [24].





AdaptiveOnlinePacking-guidedSearchforPOMDPs

Neural Information Processing Systems

Thepartially observableMarkovdecision process (POMDP) provides ageneral framework for modeling an agent's decision process with state uncertainty, and online planning plays a pivotal role in solving it. A belief is a distribution of states representing state uncertainty. Methods forlarge-scale POMDP problems rely on the same idea of sampling both states and observations.



NonstochasticMultiarmedBandits withUnrestrictedDelays

Neural Information Processing Systems

Wefirstprovethat"delayed"Exp3achievesthe O p (KT +D)lnK regret bound conjectured by Cesa-Bianchi et al. [2019] in the case of variable, but bounded delays. Here,K is the number of actions andD isthe total delay overT rounds.