AITopics | promptarmor

Collaborating Authors

promptarmor

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

PromptArmor: Simple yet Effective Prompt Injection Defenses

Shi, Tianneng, Zhu, Kaijie, Wang, Zhun, Jia, Yuqi, Cai, Will, Liang, Weida, Wang, Haonan, Alzahrani, Hend, Lu, Joshua, Kawaguchi, Kenji, Alomair, Basel, Zhao, Xuandong, Wang, William Yang, Gong, Neil, Guo, Wenbo, Song, Dawn

arXiv.org Artificial IntelligenceJul-22-2025

Despite their potential, recent research has demonstrated that LLM agents are vulnerable to prompt injection attacks, where malicious prompts are injected into the agent's input, causing it to perform an attacker-specified task rather than the intended task provided by the user. In this paper, we present PromptArmor, a simple yet effective defense against prompt injection attacks. Specifically, PromptArmor prompts an off-the-shelf LLM to detect and remove potential injected prompts from the input before the agent processes it. Our results show that PromptArmor can accurately identify and remove injected prompts. For example, using GPT-4o, GPT-4.1, or o4-mini, PromptArmor achieves both a false positive rate and a false negative rate below 1% on the AgentDojo benchmark. Moreover, after removing injected prompts with PromptArmor, the attack success rate drops to below 1%. We also demonstrate PromptArmor's effectiveness against adaptive attacks and explore different strategies for prompting an LLM. We recommend that PromptArmor be adopted as a standard baseline for evaluating new defenses against prompt injection attacks.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2507.15219

Country: Asia > Thailand (0.14)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fears workplace affairs could be exposed as Slack flaw gives hackers access to private channels

Daily Mail - Science & techAug-22-2024, 19:50:14 GMT

Hackers have developed a'difficult to trace' new method to exploit AI tools inside workplace messaging app Slack -- tricking its chatbot into sending malware. The popular collaboration platform has gained prominence for facilitating quick communications between coworkers, with some linking it to a new age of'micro-cheating' and office affairs. The cybersecurity team within Slack's research program said Tuesday that they had patched the issue on the same day outside experts first reported the flaw to them. But the vulnerability, which lets hackers disguise malicious code inside uploaded documents and Google Drive files, highlights the growing risks posed by'artificial intelligence' that lacks the'street smarts' to deal with unscrupulous user requests. While the independent security researcher who first discovered the new flaw praised Slack for its diligent response, they went public with news of the AI's vulnerability'so that users could turn off the necessary settings to decrease their exposure.'

promptarmor, slack, vulnerability, (15 more...)

Daily Mail - Science & tech

Country: North America > United States > California (0.06)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.73)

Add feedback