Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts
–Neural Information Processing Systems
Current methods for identifying adversarial prompts aimed at "attacking" LLMs and eliciting undesirable outputs are limited by several factors.
Neural Information Processing Systems
Feb-16-2026, 04:31:26 GMT
- Country:
- Asia > India (0.04)
- Europe
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Latvia > Lubāna Municipality
- Lubāna (0.04)
- Norway (0.04)
- Slovenia > Drava
- Municipality of Benedikt > Benedikt (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- France > Provence-Alpes-Côte d'Azur
- North America
- Canada > Quebec (0.04)
- United States > Oregon (0.04)
- Pacific Ocean (0.04)
- Genre:
- Research Report
- Experimental Study (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Government > Military (1.00)
- Information Technology > Security & Privacy (1.00)
- Law (0.67)
- Technology: