BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software

Open in new window