Self-supervised language learning from raw audio: Lessons from the Zero Resource Speech Challenge