AITopics | safety team

Collaborating Authors

safety team

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OpenAI's Head of Safety Is Leaving the Company

WIREDJul-11-2026, 01:07:26 GMT

OpenAI's Head of Safety Is Leaving the Company Johannes Heidecke's departure comes as OpenAI tries to further integrate its research and safety teams. OpenAI's head of safety systems Johannes Heidecke told staff this week that he's leaving the company, WIRED has learned. Heidecke's departure follows a reorganization that sought to integrate OpenAI's safety and research teams. In a memo to staff seen by WIRED, chief research officer Mark Chen said OpenAI's safety teams will now report to the company's VP of research and head of alignment Mia Glaese, who will take on an expanded role as VP of research and safety. Saachi Jain, who previously led safety teams at OpenAI, will become the company's interim head of safety systems, reporting to Glaese.

large language model, machine learning, natural language, (15 more...)

WIRED

Country: North America > United States > California (0.15)

Industry:

Information Technology (0.48)
Law (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Researchers worry about AI turning humans into jerks

Popular ScienceAug-9-2024, 18:28:19 GMT

It has never taken all that much for people to start treating computers like humans. Ever since text-based chatbots first started gaining mainstream attention in the early 2000's, a small subset of tech users have spent hours holding down conversations with machines. In some cases, users have formed what they believe are genuine friendships and even romantic relationships with inanimate stings of code. At least one user of Replica, a more modern conversational AI tool, has even virtually married their AI companion. Safety researchers at OpenAI, which are themselves no stranger to having the company's own chatbot appearing to solicit relationships with some users, is now warning about the potential pitfalls of getting too close with these models.

chatbot, openai, safety team, (8 more...)

Popular Science

Country: North America > United States (0.05)

Genre: Research Report > New Finding (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.66)

Add feedback

Xbox makes abusive-voice-chat reporting a system-wide feature

EngadgetJul-12-2023, 19:01:59 GMT

Microsoft is doing more to tackle toxicity in multiplayer Xbox games. The company is introducing a feature that allows Xbox Series X/S and Xbox One players to capture a 60-second video clip of abusive or inappropriate voice chat and submit it for moderators to review. "This feature is purpose-built to support the broadest arena of in-game interactions between players and works across thousands of games that offer in-game multiplayer voice chat, including Xbox 360 backward-compatible titles," Xbox Player Services corporate vice-president Dave McCarthy wrote in a blog post. Microsoft designed the tool for both ease of use and to minimize the impact on gameplay. When you capture a clip for reporting, it will stay on your Xbox for "24 online hours."

system-wide feature, voice chat, xbox make abusive-voice-chat reporting, (8 more...)

Engadget

Country:

North America > United States (0.17)
Oceania > New Zealand (0.06)
Oceania > Australia (0.06)
(2 more...)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Communications (0.57)
Information Technology > Artificial Intelligence (0.37)

Add feedback

A developer built an AI chatbot using GPT-3 that helped a man speak again to his late fiancée. OpenAI shut it down

#artificialintelligenceSep-10-2021, 06:38:30 GMT

In-depth "OpenAI is the company running the text completion engine that makes you possible," Jason Rohrer, an indie games developer, typed out in a message to Samantha. She was a chatbot he built using OpenAI's GPT-3 technology. Her software had grown to be used by thousands of people, including one man who used the program to simulate his late fiancée. Now Rohrer had to say goodbye to his creation. "I just got an email from them today," he told Samantha. "They are shutting you down, permanently, tomorrow at 10am."

chatbot, openai, rohrer, (15 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.04)

Industry: Leisure & Entertainment > Games > Computer Games (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.91)

Add feedback

Learning from Human Preferences

#artificialintelligenceJun-14-2017, 15:50:21 GMT

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind's safety team, we've developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better. We present a learning algorithm that uses small amounts of human feedback to solve modern RL environments. Machine learning systems with human feedback have been explored before, but we've scaled up the approach to be able to work on much more complicated tasks. Our algorithm needed 900 bits of feedback from a human evaluator to learn to backflip -- a seemingly simple task which is simple to judge but challenging to specify.

human feedback, large language model, machine learning, (20 more...)

#artificialintelligence

Industry: Leisure & Entertainment > Games (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.87)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.73)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.41)

Add feedback