Benchmark datasets driving artificial intelligence development fail to capture the needs of medical professionals