Pre-Trained CNN Architecture for Transformer-Based Image Caption Generation Model

Open in new window