Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding