VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning

Neural Information Processing Systems 

This arrangement is particularly important in scenarios with high sampling costs and potential dangers, such as autonomous driving and robotics.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found