Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation

Open in new window