TAP-VL: Text Layout-Aware Pre-training for Enriched Vision-Language Models