The Art of Saying No: Contextual Noncompliance in Language Models

Mar-20-2026, 18:41:23 GMT–Neural Information Processing Systems

Chat-based language models are designed to be helpful, yet they should not comply with every user request. While most existing work primarily focuses on refusal of ``unsafe'' queries, we posit that the scope of noncompliance should be broadened. We introduce a comprehensive taxonomy of contextual noncompliance describing when and how models should comply with user requests.

artificial intelligence, machine learning, natural language, (8 more...)

Neural Information Processing Systems

Mar-20-2026, 18:41:23 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (0.48)
  - Machine Learning (0.36)