Efficient multi-prompt evaluation of LLMs Felipe Maia Polo
–Neural Information Processing Systems
Most popular benchmarks for comparing LLMs rely on a limited set of prompt templates, which may not fully capture the LLMs' abilities and can affect the
Neural Information Processing Systems
Oct-9-2025, 21:30:57 GMT
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > Denmark
- Capital Region > Copenhagen (0.04)
- North America > United States
- Michigan (0.04)
- South America > Brazil
- Minas Gerais (0.04)
- Asia > Middle East
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (0.93)
- Research Report
- Industry:
- Education > Assessment & Standards (0.46)
- Information Technology (0.67)
- Technology: