Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following

Open in new window