e-COP: Episodic Constrained Optimization of Policies

Neural Information Processing Systems 

Through extensive empirical analysis using benchmarks in the Safety Gym suite, we show that our algorithm has similar or better performance than SoT A (non-episodic) algorithms adapted for the episodic setting.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found