Octopus: A Multi-modal LLM with Parallel Recognition and Sequential Understanding
–Neural Information Processing Systems
A mainstream of Multi-modal Large Language Models (MLLMs) have two essential functions, i.e., visual recognition ( e.g., grounding) and understanding ( e.g.,
Neural Information Processing Systems
Oct-10-2025, 12:01:43 GMT
- Country:
- Europe > Netherlands > North Holland > Amsterdam (0.04)
- Genre:
- Research Report > Experimental Study (0.93)
- Technology: