NeurIPS_2024_Touchstone1_0-3
–Neural Information Processing Systems
How can we test AI performance? This question seems trivial, but it isn't. Standard benchmarks often have problems such as in-distribution and small-size test sets, oversimplified metrics, unfair comparisons, and short-term outcome pressure. As a consequence, good performance on standard benchmarks does not guarantee success in real-world scenarios. To address these problems, we present Touchstone, a large-scale collaborative segmentation benchmark of 9 types of abdominal organs.
Neural Information Processing Systems
May-28-2025, 15:57:12 GMT
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Health & Medicine
- Diagnostic Medicine > Imaging (1.00)
- Nuclear Medicine (1.00)
- Therapeutic Area > Oncology (1.00)
- Health & Medicine
- Technology: