Toward an Evaluation Science for Generative AI Systems

Open in new window