Benchmarking Foundation Models on Exceptional Cases: Dataset Creation and Validation

Open in new window