Improving User Interface Generation Models from Designer Feedback

Wu, Jason, Swearngin, Amanda, Vajjala, Arun Krishna, Leung, Alan, Nichols, Jeffrey, Barik, Titus

Sep-23-2025–arXiv.org Artificial Intelligence

Despite being trained on vast amounts of data, most LLMs are unable to reliably generate well-designed UIs. Designer feedback is essential to improving performance on UI generation; however, we find that existing RLHF methods based on ratings or rankings are not well-aligned with designers' workflows and ignore the rich rationale used to critique and improve UI designs. In this paper, we investigate several approaches for designers to give feedback to UI generation models, using familiar interactions such as commenting, sketching and direct manipulation. We first perform a study with 21 designers where they gave feedback using these interactions, which resulted in ~1500 design annotations. We then use this data to finetune a series of LLMs to generate higher quality UIs. Finally, we evaluate these models with human judges, and we find that our designer-aligned approaches outperform models trained with traditional ranking feedback and all tested baselines, including GPT-5.

large language model, machine learning, natural language, (23 more...)

arXiv.org Artificial Intelligence

Sep-23-2025

arXiv.org PDF

Add feedback

Country:
- Europe (0.46)
- North America > United States (0.28)
- Asia > Japan (0.28)

Genre:
- Instructional Material (0.92)
- Research Report
  - New Finding (1.00)
  - Experimental Study (0.67)

Technology:
- Information Technology
  - Human Computer Interaction > Interfaces (1.00)
  - Artificial Intelligence
    - Natural Language > Large Language Model (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found