Continuous Soft Actor-Critic: An Off-Policy Learning Method Robust to Time Discretization

Open in new window