preliminary experiments with VIMCO which does not seem to outperform moving average baseline on the bit-vector

Neural Information Processing Systems 

We thank all reviewers for their comments. In 5.1, we compare against Sum&Sample, which was shown in prior work [Figure 1 of Liu et al., 2019] Upon acceptance we will normalize these baselines for all experiments and include suggested ones. Thanks for bringing this up, scalability is an important point that we want to make sure is clear in the final version. We will make this clearer. We will include this analysis and plots as suggested.