Training an Agent to Ground Commands with Reward and Punishment

Open in new window