We will first discuss general points raised by multiple reviewers, then address reviewer-specific comments

Neural Information Processing Systems 

We thank the reviewers for their detailed comments and helpful suggestions. We will first discuss general points raised by multiple reviewers, then address reviewer-specific comments. In the paper, we branded them as "variants" of REINFORCE, intending to make it easier We will clarify this distinction in our revision. Kastner et al. (2019)); while Li & Daw (2011), provide support for the view that humans may use policy-gradient Apologies, this phrase was a typo and will be removed.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found