Reviewer 1: We will be sure to provide a more accurate and nuanced discussion of the downsides of our auxiliary

Neural Information Processing Systems 

We thank all reviewers for their constructive and helpful comments. Reviewer 1: Regarding runtime evaluation, what we called the "wall clock time" is the sum of the GPU time and the CPU time, and the reported time to "run the neural net on its own" is the GPU time. We will revise our paper to include this discussion. We have filled in this gap in the literature for flow models. ANS for autoregressive models, which are slow for decoding.