Overleaf Example
–Neural Information Processing Systems
Vision-Language Models (VLMs) acquire real-world knowledge and general reasoning ability through Internet-scale image-text corpora. They can augment robotic systems with scene understanding and task planning, and assist visuomotor policies that are trained on robot trajectory data. We explore the reverse paradigm -- using rich, real, multi-modal robot trajectory data to enhance and evaluate VLMs.
Neural Information Processing Systems
Jun-15-2026, 12:55:49 GMT
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.93)
- Research Report
- Industry:
- Health & Medicine (0.67)
- Technology:
- Information Technology > Artificial Intelligence
- Robots (1.00)
- Cognitive Science > Problem Solving (0.68)
- Natural Language
- Large Language Model (1.00)
- Chatbot (0.94)
- Machine Learning > Neural Networks
- Deep Learning (1.00)
- Information Technology > Artificial Intelligence