Uncertainty
Appendix A Experiments In this section, we demonstrate our maximum likelihood estimation (8)
In the first experiment, we compare the performance of the aforementioned four alternatives. As we generated instances to satisfy the full-rank condition, i.e., Assumption E.1, or if random perturbations are applied to the underlying LMAB model [8]). (recall Figure 3). Multi-Armed Bandits problem that has been extensively studied in literature ( e.g., see [ When the time-horizon is sufficiently long but finite e.g., if Regime switching bandits LMAB may be also seen as a special type of adversarial or non-stationary bandits ( e.g., [ The standard objective in non-stationary bandits is to find the best stationary policy in hindsight with unlimited possible contexts. We focus on significantly more general cases where there is no obvious way of clustering observations, e.g., when Note that this could still be in H A regime with large number of actions A .
Confidence sequences for sampling without replacement Ian Waudby-Smith
We present a generic approach to constructing a frequentist CS using Bayesian tools, based on the fact that the ratio of a prior to the posterior at the ground truth is a martingale. We then present Hoeffding-and empirical-Bernstein-type time-uniform CSs and fixed-time confidence intervals for sampling WoR, which improve on previous bounds in the literature and explicitly quantify the benefit of WoR sampling.
paper
This work focuses on the representation learning question: how can we learn such features? Under the assumption that the underlying (unknown) dynamics correspond to a low rank transition matrix, we show how the representation learning question is related to a particular non-linear matrix decomposition problem.
Double Machine Learning Density Estimation for Local Treatment Effects with Instruments
Local treatment effects are a common quantity found throughout the empirical sciences that measure the treatment effect among those who comply with what they are assigned. Most of the literature is focused on estimating the average of such quantity, which is called the " local average treatment effect (LATE) " [
Functional Variational Inference based on Stochastic Process Generators
Bayesian inference in the space of functions has been an important topic for Bayesian modeling in the past. In this paper, we propose a new solution to this problem called Functional V ariational Inference (FVI). In FVI, we minimize a divergence in function space between the variational distribution and the posterior process.