Evaluating Large Language Models with Tests of Spanish as a Foreign Language: Pass or Fail?

Open in new window