REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once

Open in new window