ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation

Open in new window