More than a Moment: Towards Coherent Sequences of Audio Descriptions