Generalizing soft actor-critic algorithms to discrete action spaces