Retrieval-based Knowledge Augmented Vision Language Pre-training