RUST: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models
–Neural Information Processing Systems
To perform systematic evaluations, we set up 32 various tasks, including improvements to existing multimodal tasks, extension of text-only tasks to multimodal scenarios, and novel methods for risk assessment, which focus on models' basic performance with practical significance.
Neural Information Processing Systems
Nov-20-2025, 12:14:03 GMT
- Country:
- Asia > China
- Europe > Latvia
- Lubāna Municipality > Lubāna (0.04)
- North America > United States (0.04)
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (0.46)
- New Finding (0.67)
- Promising Solution (0.65)
- Industry:
- Health & Medicine (1.00)
- Information Technology > Security & Privacy (1.00)
- Law (1.00)
- Technology: