Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
–Neural Information Processing Systems
Autonomous agents that accomplish complex computer tasks with minimal human interventions can significantly enhance accessibility and productivity of humancomputer interactions. Existing benchmarks either lack interactive environments or are limited to specific applications/domains, failing to reflect the diversity and complexity of real-world computer use and limiting agent scalability.
Neural Information Processing Systems
Mar-21-2025, 07:58:23 GMT
- Country:
- Asia (0.28)
- Genre:
- Instructional Material > Course Syllabus & Notes (0.46)
- Workflow (0.95)
- Industry:
- Education
- Educational Setting > Online (0.93)
- Educational Technology (0.67)
- Information Technology > Software (0.69)
- Law (1.00)
- Education
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Natural Language
- Chatbot (0.93)
- Large Language Model (1.00)
- Representation & Reasoning > Agents (1.00)
- Vision (0.92)
- Communications
- Mobile (1.00)
- Social Media (1.00)
- Hardware (0.93)
- Human Computer Interaction (0.93)
- Software (1.00)
- Artificial Intelligence
- Information Technology