Visual Instruction Tuning

Neural Information Processing Systems 

Instruction tuning large language models (LLMs) using machine-generated instruction-following data has been shown to improve zero-shot capabilities on new tasks, but the idea is less explored in the multimodal field.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found