Learning to Reason for Long-Form Story Generation