Deconstructing Self-Bias in LLM-generated Translation Benchmarks

Open in new window