Towards Leveraging Sequential Structure in Animal Vocalizations