Enhancing Financial VQA in Vision Language Models using Intermediate Structured Representations