Benchmarking Large Language Model Capabilities for Conditional Generation

Open in new window