From Sound to Sight: Towards AI-authored Music Videos