WebGen-V Bench: Structured Representation for Enhancing Visual Design in LLM-based Web Generation and Evaluation
Wang, Kuang-Da, Wang, Zhao, Shimose, Yotaro, Wang, Wei-Yao, Takamatsu, Shingo
–arXiv.org Artificial Intelligence
Witnessed by the recent advancements on leveraging LLM for coding and multimodal understanding, we present WebGen-V, a new benchmark and framework for instruction-to-HTML generation that enhances both data quality and evaluation granularity. WebGen-V contributes three key innovations: (1) an unbounded and extensible agentic crawling framework that continuously collects real-world webpages and can leveraged to augment existing benchmarks; (2) a structured, section-wise data representation that integrates metadata, localized UI screenshots, and JSON-formatted text and image assets, explicit alignment between content, layout, and visual components for detailed multimodal supervision; and (3) a section-level multimodal evaluation protocol aligning text, layout, and visuals for high-granularity assessment. Experiments with state-of-the-art LLMs and ablation studies validate the effectiveness of our structured data and section-wise evaluation, as well as the contribution of each component. To the best of our knowledge, WebGen-V is the first work to enable high-granularity agentic crawling and evaluation for instruction-to-HTML generation, providing a unified pipeline from real-world data acquisition and webpage generation to structured multimodal assessment.
arXiv.org Artificial Intelligence
Oct-20-2025
- Country:
- Asia
- Japan
- Honshū > Kantō
- Tokyo Metropolis Prefecture > Tokyo (0.14)
- Shikoku > Kagawa Prefecture
- Takamatsu (0.04)
- Honshū > Kantō
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.14)
- Japan
- Europe > Austria
- Vienna (0.14)
- North America > United States (0.14)
- Asia
- Genre:
- Research Report > New Finding (0.67)
- Industry:
- Media (0.67)
- Transportation (1.00)
- Banking & Finance
- Education
- Curriculum (0.67)
- Educational Setting > Online (0.93)
- Health & Medicine
- Consumer Health (1.00)
- Health Care Technology (0.68)
- Pharmaceuticals & Biotechnology (0.93)
- Therapeutic Area > Dermatology (0.67)
- Appliances & Durable Goods (0.67)
- Marketing (0.92)
- Consumer Products & Services
- Information Technology > Services (0.67)
- Technology: