Multimodal Language Models See Better When They Look Shallower

Open in new window