A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look

Open in new window