Risk Sensitive Dead-end Identification in Safety-Critical Offline Reinforcement Learning

Open in new window