FastVLM: Self-Speculative Decoding for Fast Vision-Language Model Inference

Open in new window