Protecting Y our LLMs with Information Bottleneck

Oct-9-2025, 22:56:03 GMT–Neural Information Processing Systems

The advent of large language models (LLMs) has revolutionized the field of natural language processing, yet they might be attacked to produce harmful content. Despite efforts to ethically align LLMs, these are often fragile and can be circumvented by jailbreaking attacks through optimized or manual adversarial prompts.

ibprotector, information, language model, (16 more...)

Neural Information Processing Systems

Oct-9-2025, 22:56:03 GMT

Conferences PDF

Add feedback

Country:
- North America > United States
  - Pennsylvania (0.04)
- Asia > China
  - Jiangsu Province > Nanjing (0.04)

Genre:
- Research Report
  - Experimental Study (1.00)
  - New Finding (0.92)

Industry:
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Law > Criminal Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (1.00)
- Government (1.00)
- Banking & Finance (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
34a1fc7890141f1ada3d8bc6199cce07-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found