Memory Injection Attacks on LLM Agents via Query-Only Interaction
–Neural Information Processing Systems
Agents powered by large language models (LLMs) have demonstrated strong capabilities in a wide range of complex, real-world applications. However, LLM agents with a compromised memory bank may easily produce harmful outputs when the past records retrieved for demonstration are malicious. In this paper, we propose a novel Memory INJection Attack, MINJA, without assuming that the attacker can directly modify the memory bank of the agent.
Neural Information Processing Systems
Jun-11-2026, 22:25:32 GMT
- Technology: