Voila-A: Aligning Vision-Language Models with User's Gaze Attention, Lei Ji
–Neural Information Processing Systems
In recent years, the integration of vision and language understanding has led to significant advancements in artificial intelligence, particularly through Vision-Language Models (VLMs).
Neural Information Processing Systems
May-28-2025, 07:18:57 GMT
- Genre:
- Research Report
- Experimental Study (0.93)
- Promising Solution (0.67)
- Research Report
- Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (0.68)
- Technology: