RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents

Open in new window