AMPS: ASR with Multimodal Paraphrase Supervision