InnovatorBench: Evaluating Agents' Ability to Conduct Innovative LLM Research

Open in new window