Deliberation Networks and How to Train Them