Respond Beyond Language: A Benchmark for Video Generation in Response to Realistic User Intents

Open in new window