Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning

Neural Information Processing Systems 

Additionally, we offer a practical version of WSAC and compare it with existing state-of-the-art safe offline RL algorithms in several continuous control environments.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found