FastVLM: Efficient Vision Encoding for Vision Language Models

Open in new window