Diagnosing Bottlenecks in Data Visualization Understanding by Vision-Language Models