Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation Adam Fisch, Joshua Maynez, R. Alex Hofer Bhuwan Dhingra Amir Globerson William W. Cohen

Open in new window