Evaluating Large Language Models on Controlled Generation Tasks

Open in new window