AITopics | safety and control

Collaborating Authors

safety and control

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Do Large Language Models Have a Planning Theory of Mind? Evidence from MindGames: a Multi-Step Persuasion Task

Moore, Jared, Cooper, Ned, Overmark, Rasmus, Cibralic, Beba, Haber, Nick, Jones, Cameron R.

arXiv.org Artificial IntelligenceJul-23-2025

Recent evidence suggests Large Language Models (LLMs) display Theory of Mind (ToM) abilities. Most ToM experiments place participants in a spectatorial role, wherein they predict and interpret other agents' behavior. However, human ToM also contributes to dynamically planning action and strategically intervening on others' mental states. We present MindGames: a novel `planning theory of mind' (PToM) task which requires agents to infer an interlocutor's beliefs and desires to persuade them to alter their behavior. Unlike previous evaluations, we explicitly evaluate use cases of ToM. We find that humans significantly outperform o1-preview (an LLM) at our PToM task (11% higher; $p=0.006$). We hypothesize this is because humans have an implicit causal model of other agents (e.g., they know, as our task requires, to ask about people's preferences). In contrast, o1-preview outperforms humans in a baseline condition which requires a similar amount of planning but minimal mental state inferences (e.g., o1-preview is better than humans at planning when already given someone's preferences). These results suggest a significant gap between human-like social reasoning and LLM abilities.

information, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2507.16196

Country: North America > United States (0.93)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.66)

Industry:

Education (1.00)
Leisure & Entertainment (0.93)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Workshop on Safety and Control for AI by White House OSTP/Carnegie Mellon Univ • /r/artificial

#artificialintelligenceMay-31-2016, 13:51:47 GMT

We here at Carnegie Mellon University wanted to let you know about a great event on artificial intelligence that we're hosting in conjunction with the White House Office of Science and Technology Policy in late June. You may have seen this recent article on these workshops featured in Wired. While we are but one of the four workshops going on in the coming months, we are the ONLY workshop in the series with a clear focus on the technical aspects of safe and controlled AI. We want to dive deep on how we can bring together machine learning, math-based systems reasoning, and software architecture to build AI systems with a high level of assurance. And we'd love for you to be a part of that conversation here in Pittsburgh.

artificial intelligence, machine learning, safety and control, (1 more...)

#artificialintelligence

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.11)

Industry: Government > Regional Government > North America Government > United States Government (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.31)

Add feedback

SafArtInt 2016

#artificialintelligenceMay-26-2016, 20:15:48 GMT

The computer science community has been exploring the role of artificial intelligence (AI) in systems for more than a half-century. In the last few years, AI development has reached a threshold of practicability, and AI capability is now emerging in sectors ranging from vehicles, logistics, and military systems to health care, financial services, and smart cities. The economic and societal impacts could be dramatic, and investment in the development of AI applications is now a world-wide phenomenon. Many technical leaders now believe that the principal limits on exploiting AI derive primarily from our confidence in the safety of these smart systems – that they will operate in a safe and controlled manner. Some AI experts have asserted that the ability to assure safety and control is more important to the future of AI even than improvements in the AI algorithms themselves.

ai system, artificial intelligence, workshop, (7 more...)

#artificialintelligence

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.42)

Industry: Government > Regional Government > North America Government > United States Government (0.34)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback