Memory Injection Attacks on LLM Agents via Query-Only Interaction

Jun-11-2026, 22:25:32 GMT–Neural Information Processing Systems

Agents powered by large language models (LLMs) have demonstrated strong capabilities in a wide range of complex, real-world applications. However, LLM agents with a compromised memory bank may easily produce harmful outputs when the past records retrieved for demonstration are malicious. In this paper, we propose a novel Memory INJection Attack, MINJA, without assuming that the attacker can directly modify the memory bank of the agent.

artificial intelligence, large language model, natural language, (13 more...)

Neural Information Processing Systems

Jun-11-2026, 22:25:32 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.86)