Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding

Open in new window