Distributed Soft Actor-Critic with Multivariate Reward Representation and Knowledge Distillation

Open in new window