Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation Adam Fisch, Joshua Maynez, R. Alex Hofer Bhuwan Dhingra Amir Globerson William W. Cohen

Neural Information Processing Systems 

Evaluating machine learning models requires evaluation data.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found