The Two Word Test: A Semantic Benchmark for Large Language Models

Open in new window