FEABench: Evaluating Language Models on Multiphysics Reasoning Ability

Open in new window