Review for NeurIPS paper: Softmax Deep Double Deterministic Policy Gradients
–Neural Information Processing Systems
The reviewers appreciate the simple idea brought up in the paper and the experiments designed to understand its effect and the theoretical justification. Some reviewers did express concerns regarding the significance of the theoretical results and the concerns remain after the rebuttal. Please try to incorporate these feedback in your final draft.
Neural Information Processing Systems
Jan-26-2025, 09:14:46 GMT
- Technology: