ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian

Open in new window