CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Open in new window