CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks

Open in new window