Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Feb-16-2026, 04:31:26 GMT–Neural Information Processing Systems

Current methods for identifying adversarial prompts aimed at "attacking" LLMs and eliciting undesirable outputs are limited by several factors.

large language model, machine learning, natural language, (21 more...)

Neural Information Processing Systems

Feb-16-2026, 04:31:26 GMT

Conferences PDF

Country:
- Pacific Ocean (0.04)
- Asia > India (0.04)
- North America
  - United States > Oregon (0.04)
  - Canada > Quebec (0.04)
- Europe
  - Norway (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
  - Slovenia > Drava
    - Municipality of Benedikt > Benedikt (0.04)
  - Latvia > Lubāna Municipality
    - Lubāna (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)

Genre:
- Research Report
  - New Finding (1.00)
  - Experimental Study (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)
- Government > Military (1.00)
- Law (0.67)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language
      - Large Language Model (1.00)
      - Chatbot (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Similar Docs Excel Report more

Title	Similarity	Source
None found