Why we must rethink AI benchmarks
This article is part of our reviews of AI research papers, a series of posts that explore the latest findings in artificial intelligence. For decades, researchers have used benchmarks to measure progress in different areas of artificial intelligence such as vision and language. Especially in the past few years, with deep learning becoming very popular, benchmarks have become a narrow focus for many research labs and scientists. But while benchmarks can help compare the performance of AI systems on specific problems, they are often taken out of context, sometimes to harmful results. In a paper accepted at the NeurIPS 2021 conference, scientists at University of California, Berkeley, University of Washington, and Google outline the limits of popular AI benchmarks.
May-23-2022, 01:11:45 GMT
- Country:
- North America > United States > California > Alameda County > Berkeley (0.25)
- Genre:
- Research Report (0.95)
- Technology: