SciCode: A Research Coding Benchmark Curated by Scientists

Neural Information Processing Systems 

Because LMs now surpass the performance of most humans except domain experts, evaluating them becomes increasingly challenging.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found