Generalizing soft actor-critic algorithms to discrete action spaces

Open in new window