Improving a Neural Semantic Parser by Counterfactual Learning from Human Bandit Feedback

Open in new window