Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases
–Neural Information Processing Systems
LLM agents have demonstrated remarkable performance across various applications, primarily due to their advanced capabilities in reasoning, utilizing external knowledge and tools, calling APIs, and executing actions to interact with environments. Current agents typically utilize a memory module or a retrieval-augmented generation (RAG) mechanism, retrieving past knowledge and instances with similar embeddings from knowledge bases to inform task planning and execution. However, the reliance on unverified knowledge bases raises significant concerns about their safety and trustworthiness.
Neural Information Processing Systems
Jun-2-2025, 06:07:57 GMT
- Country:
- North America > United States > Illinois (0.28)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Health & Medicine (0.93)
- Information Technology > Security & Privacy (0.93)
- Transportation (0.68)
- Technology: