Technical Perspective: The Importance of WINOGRANDE
Excelling at a test often does not translate into excelling at the skills the test purports to measure. This is true not only of humans but also of AI systems, and the more so the greater the claims of the test's significance. This became evident less than a decade after the introduction of the Winograd Schema Challenge (WSC),3 a test designed to measure an AI system's commonsense reasoning (CSR) ability by answering simple questions. An example would be, given the information: The sculpture rolled off the shelf because it wasn't anchored, answering: What wasn't anchored? There are multiple AI systems2 that achieve human performance on the WSC but are not capable of performing CSR.
Aug-24-2021, 22:51:00 GMT
- AI-Alerts:
- 2021 > 2021-08 > AAAI AI-Alert for Aug 31, 2021 (1.00)
- Country:
- North America > United States (0.15)
- Technology: