ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models
–Neural Information Processing Systems
Multi-turn visual conversation is an important ability of real-world AI assistants. However, the related evaluation benchmark is missed.
Neural Information Processing Systems
Oct-10-2025, 14:13:08 GMT
- Country:
- Asia > China
- Indian Ocean > Arabian Sea (0.04)
- Genre:
- Research Report (0.92)
- Industry:
- Information Technology > Security & Privacy (0.45)
- Law (1.00)
- Leisure & Entertainment > Games (0.68)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning > Neural Networks
- Deep Learning (0.72)
- Natural Language
- Chatbot (0.72)
- Large Language Model (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Information Technology > Artificial Intelligence