ZNO-Eval: Benchmarking reasoning capabilities of large language models in Ukrainian