The unknotting number, hard unknot diagrams, and reinforcement learning