HYPE: A Benchmark for Human eYe Perceptual Evaluation of Generative Models
Sharon Zhou, Mitchell Gordon, Ranjay Krishna, Austin Narcomey, Li F. Fei-Fei, Michael Bernstein
–Neural Information Processing Systems
Generative models often use human evaluations to measure the perceived quality of their outputs. Automated metrics are noisy indirect proxies, because they rely on heuristics or pretrained embeddings. However, up until now, direct human evaluation strategies have been ad-hoc, neither standardized nor validated. Our work establishes a gold standard human benchmark for generative realism.
Neural Information Processing Systems
Mar-23-2025, 20:53:42 GMT
- Country:
- North America > Canada (0.14)
- Genre:
- Research Report
- Experimental Study (0.48)
- New Finding (0.47)
- Research Report
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning (1.00)
- Natural Language > Generation (0.86)
- Vision (1.00)
- Information Technology > Artificial Intelligence