ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Ablation Capability for Large Vision-Language Models

Neural Information Processing Systems 

Multi-turn visual conversation is an important ability of real-world AI assistants. However, the related evaluation benchmark is missed.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found