CARE: Decoding-Time Safety Alignment via Rollback and Introspection Intervention

Jun-12-2026, 01:22:34 GMT–Neural Information Processing Systems

As large language models (LLMs) are increasingly deployed in real-world applications, ensuring the safety of their outputs during decoding has become a critical challenge.

artificial intelligence, large language model, natural language, (10 more...)

Neural Information Processing Systems

Jun-12-2026, 01:22:34 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.60)