ProxAnn: Use-Oriented Evaluations of Topic Models and Document Clustering
Hoyle, Alexander, Calvo-Bartolomé, Lorena, Boyd-Graber, Jordan, Resnik, Philip
–arXiv.org Artificial Intelligence
Topic model and document-clustering evaluations either use automated metrics that align poorly with human preferences or require expert labels that are intractable to scale. We design a scalable human evaluation protocol and a corresponding automated approximation that reflect practitioners' real-world usage of models. Annotators -- or an LLM-based proxy -- review text items assigned to a topic or cluster, infer a category for the group, then apply that category to other documents. Using this protocol, we collect extensive crowdworker annotations of outputs from a diverse set of topic models on two datasets. We then use these annotations to validate automated proxies, finding that the best LLM proxies are statistically indistinguishable from a human annotator and can therefore serve as a reasonable substitute in automated evaluations. Package, web interface, and data are at https://github.com/ahoho/proxann
arXiv.org Artificial Intelligence
Jul-2-2025
- Country:
- Asia
- China > Hong Kong (0.04)
- Middle East
- Jordan (0.05)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Ireland (0.04)
- United Kingdom (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Netherlands > South Holland
- Dordrecht (0.04)
- France > Hauts-de-France
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Germany > Berlin (0.04)
- Switzerland > Zürich
- Zürich (0.04)
- Spain > Galicia
- Madrid (0.04)
- North America
- Canada > British Columbia (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- Virginia (0.04)
- Delaware > Kent County (0.04)
- Ohio (0.04)
- Illinois > Cook County
- Chicago (0.04)
- District of Columbia (0.04)
- Kansas (0.04)
- Maryland (0.04)
- California > San Diego County
- San Diego (0.04)
- South Carolina (0.04)
- Washington > King County
- Pacific Ocean (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Banking & Finance (0.67)
- Education > Educational Setting (0.92)
- Government
- Health & Medicine (1.00)
- Law > Statutes (0.68)
- Leisure & Entertainment > Sports
- Baseball (1.00)
- Technology: