SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models

Open in new window