VAL: Evaluate Large Language Model as Critic Tian Lan 1
–Neural Information Processing Systems
Critique ability, i.e., the capability of Large Language Models (LLMs) to identify and rectify flaws in responses, is crucial for their applications in self-improvement and scalable oversight. While numerous studies have been proposed to evaluate critique ability of LLMs, their comprehensiveness and reliability are still limited.
Neural Information Processing Systems
May-30-2025, 07:47:54 GMT
- Country:
- Asia > China (0.45)
- North America > United States
- Minnesota > Hennepin County > Minneapolis (0.14)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Technology: