QA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content João Monteiro,3, Pierre-André Noël
–Neural Information Processing Systems
Consequently, evaluating models on test splits that might have leaked into the training set is prone to misleading conclusions.
Neural Information Processing Systems
Oct-9-2025, 21:49:31 GMT
- Country:
- Asia
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada
- Dominican Republic (0.04)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas > Travis County
- Austin (0.04)
- Louisiana > Orleans Parish
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Education (1.00)
- Energy (0.67)
- Government (0.93)
- Health & Medicine > Therapeutic Area (0.68)
- Information Technology > Security & Privacy (0.68)
- Technology: