AITopics | depersonalisation

Collaborating Authors

depersonalisation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

The TIP of the Iceberg: Revealing a Hidden Class of Task-in-Prompt Adversarial Attacks on LLMs

Berezin, Sergey, Farahbakhsh, Reza, Crespi, Noel

arXiv.org Artificial IntelligenceFeb-4-2025

We present a novel class of jailbreak adversarial attacks on LLMs, termed Task-in-Prompt (TIP) attacks. Our approach embeds sequence-to-sequence tasks (e.g., cipher decoding, riddles, code execution) into the model's prompt to indirectly generate prohibited inputs. To systematically assess the effectiveness of these attacks, we introduce the PHRYGE benchmark. We demonstrate that our techniques successfully circumvent safeguards in six state-of-the-art language models, including GPT-4o and LLaMA 3.2. Our findings highlight critical weaknesses in current LLM safety alignments and underscore the urgent need for more sophisticated defence strategies. Warning: this paper contains examples of unethical inquiries used solely for research purposes.

large language model, llama 3, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2501.18626

Country:

North America > United States > New York > New York County > New York City (0.04)
Europe > France (0.04)
Asia > Thailand > Bangkok > Bangkok (0.04)

Genre: Research Report > New Finding (0.88)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Be vigilant

#artificialintelligenceMar-23-2018, 09:08:16 GMT

Sophia, the worlds most advanced humanoid released to date was granted an honorary citizenship a few months ago by Saudi Arabia. In a move that set the net flooding with awe and dismay, this act probably triggered the first step towards recognising artificial intelligence being in the room and not at door step. The UN joined to recognise Sophia as the world's first UN Innovation Champion by UNDP. While these moves were music to many, artificial intelligence is raising a lot of divided opinions across the best of brains in science and technology. A quote widely in circulation on the social media on Einstein's premonition of a world having a generation of idiots may have its fair share of laughs. Einstein had indeed written a letter to his friend, psychiatrist Otto Juliusburger, in 1948 where he believed that the abominable deterioration of ethical standards stemmed primarily from the mechanisation and depersonalisation of our lives, a disastrous byproduct of science and technology.

artificial intelligence, depersonalisation, science and technology, (13 more...)

#artificialintelligence

Country: Asia > Middle East > Saudi Arabia (0.25)

Industry:

Government (0.51)
Health & Medicine > Therapeutic Area (0.36)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback