Goto

Collaborating Authors

 constitution


How Trump Keeps Exploiting America's Legal Loopholes

TIME - Tech

Follow this section to personalize your feed and get instant alerts. Follow Go to your personalized feed WHY FOLLOW? Smart Alerts: Get notified about major news as it happens. Follow this tag to personalize your feed and get instant alerts. Follow Go to your personalized feed WHY FOLLOW?


No, Artificial Intelligence Is Not Conscious

The Atlantic - Technology

Taken to its logical conclusion, this line of thinking is absurd--and damning. Anthropic is regarded as a giant among AI companies, but perhaps what it really excels in is anthropomorphism. Earlier this year, the company released an 84-page document titled Claude's "constitution," Claude being the name of the large language model that is the company's flagship product. The first sentence reads, "Claude's constitution is a detailed description of Anthropic's intentions for Claude's values and behaviors." It goes on: "The document is written with Claude as its primary audience," "we want Claude to be able to use its judgment once armed with a good understanding of the relevant considerations," "Claude's moral status is deeply uncertain," and "Claude may have some functional version of emotions or feelings." This anthropomorphism is by no means limited to the document. In an interview earlier this year, Anthropic's CEO, Dario Amodei, said that "we're open to the idea" that AI could be conscious. In a separate interview, Anthropic's in-house philosopher, Amanda Askell (who is credited as a lead author of Claude's constitution), said, "I want Claude to be very happy--and this is a thing that I want Claude to know more, because I worry about Claude getting anxious when people are mean to it on the internet and stuff." It's enough to make you wonder: Should we seriously consider the possibility that Claude, or any large language model, might be conscious? And if it has feelings, is it capable of receiving moral instruction?


Self-Supervised Alignment with Mutual Information: Learning to Follow Principles without Preference Labels

Neural Information Processing Systems

When prompting a language model (LM), users often expect the model to adhere to a set of behavioral principles across diverse tasks, such as producing insightful content while avoiding harmful or biased language. Instilling such principles (i.e., a constitution) into a model is resource-intensive, technically challenging, and generally requires human preference labels or examples. We introduce SAMI, an iterative algorithm that finetunes a pretrained language model (without requiring preference labels or demonstrations) to increase the conditional mutual information between constitutions and self-generated responses given queries from a dataset. On single-turn dialogue and summarization, a SAMI-trained mistral-7b outperforms the initial pretrained model, with win rates between 66% and 77%.



The Only Thing Standing Between Humanity and AI Apocalypse Is โ€ฆ Claude?

WIRED

The Only Thing Standing Between Humanity and AI Apocalypse Is Claude? As AI systems grow more powerful, Anthropic's resident philosopher says the startup is betting Claude itself can learn the wisdom needed to avoid disaster. Anthropic is locked in a paradox: Among the top AI companies, it's the most obsessed with safety and leads the pack in researching how models can go wrong. But even though the safety issues it has identified are far from resolved, Anthropic is pushing just as aggressively as its rivals toward the next, potentially more dangerous, level of artificial intelligence. Its core mission is figuring out how to resolve that contradiction. Last month, Anthropic released two documents that both acknowledged the risks associated with the path it's on and hinted at a route it could take to escape the paradox.


From 'nerdy' Gemini to 'edgy' Grok: how developers are shaping AI behaviours

The Guardian

Which chatbot we choose could become an extension and reflection of our personalities, like the clothes we wear or car we drive. Which chatbot we choose could become an extension and reflection of our personalities, like the clothes we wear or car we drive. From'nerdy' Gemini to'edgy' Grok: how developers are shaping AI behaviours Do you want an AI assistant that gushes about how it "loves humanity" or one that spews sarcasm? How about a political propagandist ready to lie? If so, ChatGPT, Grok and Qwen are at your disposal. Companies that create AI assistants, from the US to China, are increasingly wrestling with how to mould their characters, and it is no abstract debate.


How Do You Teach an AI to Be Good? Anthropic Just Published Its Answer

TIME - Tech

How Do You Teach an AI to Be Good? A person holds a smartphone displaying the logo of "Claude," an AI language model by Anthropic A person holds a smartphone displaying the logo of "Claude," an AI language model by Anthropic Cheng Xin/Getty Images Getting AI models to behave used to be a thorny mathematical problem. These days, it looks a bit more like raising a child. That, at least, is according to Amanda Askell --a trained philosopher whose unique role within Anthropic is crafting the personality of Claude, the AI firm's rival to ChatGPT. "Imagine you suddenly realize that your six-year-old child is a kind of genius," Askell says.


5 new quarters commemorate 250 years of American independence

Popular Science

The new designs honor the Constitution, Civil War, and more. Breakthroughs, discoveries, and DIY tips sent every weekday. While we've said goodbye to both the year 2025 and the penny, five new United States quarters will be finding their way into your pocket soon enough. The designs of each new quarter will honor the country's 250th anniversary (aka its semiquincentennial). According to a press release from the U.S. Mint, the coins "commemorate 250 years of American Liberty by reflecting our country's founding principles and honoring our Nation's history."


Russia-Ukraine war: List of key events, day 1,394

Al Jazeera

What is in the 28-point US plan for Ukraine? 'Ukraine is running out of men, money and time' Can the US get all sides to end the war? Why is Europe opposing Trump's peace plan? Three people, including two crew members of a cargo vessel, were killed in overnight Ukrainian drone attacks on the Russian port of Rostov-on-Don and the town of Bataysk in the country's southern Rostov region, local governor Yury Slyusar said. Russian strikes near Ukraine's Black Sea port of Odesa killed a woman in her car and hit infrastructure.


MORNING GLORY: A President Donald Trump-branded energy drink?

FOX News

This material may not be published, broadcast, rewritten, or redistributed. Quotes displayed in real-time or delayed by at least 15 minutes. Market data provided by Factset . Powered and implemented by FactSet Digital Solutions . Mutual Fund and ETF data provided by Refinitiv Lipper .