Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models