Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
–Neural Information Processing Systems
Large vision-language models (VLMs) fine-tuned on specialized visual instruction-following data have exhibited impressive language reasoning capabilities across various scenarios.
Neural Information Processing Systems
Oct-10-2025, 16:29:55 GMT
- Country:
- Asia
- China
- Guangdong Province > Shenzhen (0.04)
- Hong Kong (0.04)
- Middle East > Jordan (0.04)
- China
- Europe > Sweden
- North America > United States
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Washington > King County
- Seattle (0.04)
- Pennsylvania > Allegheny County
- Asia
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.67)
- Research Report
- Industry:
- Leisure & Entertainment > Games (1.00)
- Technology: