XIFBench: Evaluating Large Language Models on Multilingual Instruction Following

Open in new window