Benchmarks for Automated Commonsense Reasoning: A Survey
–arXiv.org Artificial Intelligence
More than one hundred benchmarks have been developed to test the commonsense knowledge and commonsense reasoning abilities of artificial intelligence (AI) systems. However, these benchmarks are often flawed and many aspects of common sense remain untested. Consequently, we do not currently have any reliable way of measuring to what extent existing AI systems have achieved these abilities. This paper surveys the development and uses of AI commonsense benchmarks. We discuss the nature of common sense; the role of common sense in AI; the goals served by constructing commonsense benchmarks; and desirable features of commonsense benchmarks. We analyze the common flaws in benchmarks, and we argue that it is worthwhile to invest the work needed ensure that benchmark examples are consistently high quality. We survey the various methods of constructing commonsense benchmarks. We enumerate 139 commonsense benchmarks that have been developed: 102 text-based, 18 image-based, 12 video based, and 7 simulated physical environments. We discuss the gaps in the existing benchmarks and aspects of commonsense reasoning that are not addressed in any existing benchmark. We conclude with a number of recommendations for future development of commonsense AI benchmarks.
arXiv.org Artificial Intelligence
Feb-22-2023
- Country:
- Africa > Middle East (0.04)
- Asia
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Vietnam > Long An Province (0.04)
- Middle East > Republic of Türkiye
- Europe
- Austria (0.04)
- Czechia > Prague (0.04)
- Hungary > Budapest
- Budapest (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Poland (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Oxfordshire > Oxford (0.04)
- Indian Ocean > Bay of Bengal (0.04)
- North America > United States
- California
- San Francisco County > San Francisco (0.04)
- Santa Clara County > Palo Alto (0.04)
- Indiana (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Michigan (0.04)
- Missouri (0.04)
- New York > New York County
- New York City (0.04)
- Utah (0.04)
- California
- Oceania > Australia
- New South Wales (0.04)
- Pacific Ocean > North Pacific Ocean
- San Francisco Bay > Golden Gate (0.04)
- Genre:
- Research Report (0.84)
- Industry:
- Education > Curriculum
- Subject-Specific Education (0.45)
- Health & Medicine (0.67)
- Leisure & Entertainment > Games (0.92)
- Materials (0.67)
- Media > News (1.00)
- Education > Curriculum
- Technology: