WONDERBREAD: A Benchmark for Evaluating Multimodal Foundation Models on Business Process Management Tasks
–Neural Information Processing Systems
Existing ML benchmarks lack the depth and diversity of annotations needed for evaluating models on business process management (BPM) tasks. BPM is the practice of documenting, measuring, improving, and automating enterprise workflows. However, research has focused almost exclusively on one task - full end-to-end automation using agents based on multimodal foundation models (FMs) like GPT-4. This focus on automation ignores the reality of how most BPM tools are applied today - simply documenting the relevant workflow takes 60% of the time of the typical process optimization project.
Neural Information Processing Systems
Mar-27-2025, 09:11:15 GMT
- Country:
- Asia (0.27)
- North America > United States (0.28)
- Genre:
- Research Report > New Finding (0.67)
- Workflow (1.00)
- Industry:
- Health & Medicine > Health Care Providers & Services (0.46)
- Information Technology > Software (0.45)
- Technology: