Off-policy Learning with Options and Recognizers

Open in new window