Red-teaming LLM Agents via Poisoning Memory or Knowledge Bases

Jun-2-2025, 06:07:57 GMT–Neural Information Processing Systems

LLM agents have demonstrated remarkable performance across various applications, primarily due to their advanced capabilities in reasoning, utilizing external knowledge and tools, calling APIs, and executing actions to interact with environments. Current agents typically utilize a memory module or a retrieval-augmented generation (RAG) mechanism, retrieving past knowledge and instances with similar embeddings from knowledge bases to inform task planning and execution. However, the reliance on unverified knowledge bases raises significant concerns about their safety and trustworthiness.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Jun-2-2025, 06:07:57 GMT

Conferences PDF

Add feedback

Country:
- North America > United States > Illinois (0.28)

Genre:
- Research Report > Experimental Study (0.93)

Industry:
- Health & Medicine (0.93)
- Information Technology > Security & Privacy (0.93)
- Transportation (0.68)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.67)
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning (1.00)