The RealHumanEval: Evaluating Large Language Models' Abilities to Support Programmers

Open in new window