Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs

Open in new window