Masked Generative Story Transformer with Character Guidance and Caption Augmentation