VideoWebArena: Evaluating Long Context Multimodal Agents with Video Understanding Web Tasks

Open in new window