Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model

Open in new window