f0eb6568ea114ba6e293f903c34d7488-AuthorFeedback.pdf

Neural Information Processing Systems 

Rephrase any claims that seem too strong, add additional reference and discuss more connections to previous works. Paper too long We will reorganize our paper (see general response). The red lines in bars represent median rewards. We improve reward under attacks consistently across runs. We vary ɛ bounds in Figure 1.