Context-Aware Testing: A New Paradigm for Model Testing with Large Language Models

Neural Information Processing Systems 

The predominant de facto paradigm of testing ML models relies on either using only held-out data to compute aggregate evaluation metrics or by assessing the performance on different subgroups.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found