confession
The Download: LLM confessions, and tapping into geothermal hot spots
OpenAI is testing a new way to expose the complicated processes at work inside large language models. Researchers at the company can make an LLM produce what they call a confession, in which the model explains how it carried out a task and (most of the time) own up to any bad behavior. Figuring out why large language models do what they do--and in particular why they sometimes appear to lie, cheat, and deceive--is one of the hottest topics in AI right now. If this multitrillion-dollar technology is to be deployed as widely as its makers hope it will be, it must be made more trustworthy. OpenAI sees confessions as one step toward that goal. Sometimes geothermal hot spots are obvious, marked by geysers and hot springs on Earth's surface.
- Asia > China (0.06)
- North America > United States > New York (0.05)
- North America > United States > Nevada (0.05)
- (2 more...)
- Health & Medicine > Therapeutic Area (1.00)
- Energy > Renewable > Geothermal > Geothermal Resource Type (0.92)
OpenAI has trained its LLM to confess to bad behavior
Large language models often lie and cheat. We can't stop that--but we can make them own up. OpenAI is testing another new way to expose the complicated processes at work inside large language models. Researchers at the company can make an LLM produce what they call a confession, in which the model explains how it carried out a task and (most of the time) owns up to any bad behavior. Figuring out why large language models do what they do--and in particular why they sometimes appear to lie, cheat, and deceive--is one of the hottest topics in AI right now. If this multitrillion-dollar technology is to be deployed as widely as its makers hope it will be, it must be made more trustworthy.
- North America > United States > Massachusetts (0.05)
- Asia > China (0.05)
Towards Trustworthy Legal AI through LLM Agents and Formal Reasoning
Chen, Linze, Cai, Yufan, Hou, Zhe, Dong, Jinsong
The rationality of law manifests in two forms: substantive rationality, which concerns the fairness or moral desirability of outcomes, and formal rationality, which requires legal decisions to follow explicitly stated, general, and logically coherent rules. Existing LLM-based systems excel at surface-level text analysis but lack the guarantees required for principled jurisprudence. We introduce L4M, a novel framework that combines adversarial LLM agents with SMT-solver-backed proofs to unite the interpretive flexibility of natural language with the rigor of symbolic verification. The pipeline consists of three phases: (1) Statute Formalization, where domain-specific prompts convert legal provisions into logical formulae; (2) Dual Fact and Statute Extraction, in which prosecutor- and defense-aligned LLMs independently map case narratives to fact tuples and statutes, ensuring role isolation; and (3) Solver-Centric Adjudication, where an autoformalizer compiles both parties' arguments into logic constraints, and unsat cores trigger iterative self-critique until a satisfiable formula is achieved, which is then verbalized by a Judge-LLM into a transparent verdict and optimized sentence. Experimental results on public benchmarks show that our system surpasses advanced LLMs including GPT-o4-mini, DeepSeek-V3, and Claude 4 as well as state-of-the-art Legal AI baselines, while providing rigorous and explainable symbolic justifications.
- Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
- Asia > Singapore (0.04)
- Law > Criminal Law (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.94)
Confessions of a Recovering AI Porn Addict
Kyle's interest in AI porn began last summer as he circled rock bottom. From the outside, everything seemed fine. He was in a committed relationship with his longtime girlfriend. He enjoyed the perks of his job working for a sports betting company. Still, all he could think about was fueling his porn addiction in new ways--even at the cost of feeling mentally drained and tired.
Deus in machina: Swiss church installs AI-powered Jesus
The small, unadorned church has long ranked as the oldest in the Swiss city of Lucerne. But Peter's chapel has become synonymous with all that is new after it installed an artificial intelligence-powered Jesus capable of dialoguing in 100 different languages. "It was really an experiment," said Marco Schmid, a theologian with the church. "We wanted to see and understand how people react to an AI Jesus. What would they talk with him about? Would there be interest in talking to him? The installation, known as Deus in Machina, was launched in August as the latest initiative in a years-long collaboration with a local university research lab on immersive reality. After projects that had experimented with virtual and augmented reality, the church decided that the next step was to install an avatar. Schmid said: "We had a discussion about what kind of avatar it would be – a theologian, a person or a saint?
'If journalism is going up in smoke, I might as well get high off the fumes': confessions of a chatbot helper
For several hours a week, I write for a technology company worth billions of dollars. Alongside me are published novelists, rising academics and several other freelance journalists. The workload is flexible, the pay better than we are used to, and the assignments never run out. But what we write will never be read by anyone outside the company. We are writing for an AI.
- Information Technology (0.90)
- Media > News (0.70)
Confessions of an AI Clickbait Kingpin
"I'm not a fan of AI," Nebojša Vujinović Vujo says. The admission surprises me: He has built a bustling business by snapping up abandoned news outlets and other websites and stuffing them full of algorithmically generated articles. Although he accepts that his model rankles writers and readers alike, he says he's simply embracing an unstoppable new tool--large language models--in the same way people rationally swapped horse-drawn buggies for gas-powered vehicles. They're making my planet bad," he says. I connected with Vujo after digging into the strange afterlife of indie women's blog The Hairpin, which shut down in 2018. In place of the voicey, funny blog posts it was known for, the site began churning out AI-generated, search-engine-optimized pablum about dream interpretations and painfully generic relationship advice like "effective communication is vital." When I emailed an address listed on the zombie site's About Us page, Vujo responded, claiming that it was just one of more than 2,000 sites he operates, in an AI-content-fueled fiefdom built by acquiring once-popular domains fallen on hard times. He's the CEO of the digital marketing firm Shantel, which monetizes its AI-populated sites through programmatic ads, sponsored content, and selling the placement of "backlinks" to website owners trying to boost their credibility with search engines. He often targets distressed media sites because they have built-in audiences and a history of ranking highly in search results. The foundation of that business is a long-established practice known as domain squatting--buying up web domains that once belonged to established brands and profiting off their reputations with Google and other search engines. Lily Ray, senior director of SEO at the marketing agency Ampsive, calls it "the underbelly of the SEO industry." But Vujo is part of a wave of entrepreneurs giving this old trade a new twist by using generative AI. It's dusk where I live in Chicago when I talk via Zoom with Nebojša Vujinović Vujo. It's midnight in Belgrade, Serbia, where he lives with his girlfriend and their toddler, but he's wide awake and chatty. Vujo attributes his erratic sleep schedule to years of late nights working as a DJ and still makes music--he likes to mix pop with Balkan folk and is working on a new song called "Fat Lady." But right now he's eager to talk, human-to-human, about his AI-fueled hustle. He gets why writers are unhappy that their work has been erased and replaced by clickbait. But he defends his choices, pointing out that his life has been tougher than that of the average American blogger. Although ethnically Serbian, Vujo was born in what is now known as Bosnia and Herzegovina, and his family fled during the breakup of Yugoslavia. "I had two wars I escaped.
- North America > United States > Illinois > Cook County > Chicago (0.25)
- Europe > Serbia > Central Serbia > Belgrade (0.25)
- Europe > Bosnia and Herzegovina (0.25)
- North America > United States > New Jersey > Hudson County > Jersey City (0.05)
Confessions of a Viral AI Writer
Six or seven years ago, I realized I should learn about artificial intelligence. I'm a journalist, but in my spare time I'd been writing a speculative novel set in a world ruled by a corporate, AI-run government. The problem was, I didn't really understand what a system like that would look like. I started pitching articles that would give me an excuse to find out, and in 2017 I was assigned to profile Sam Altman, a cofounder of OpenAI. One day I sat in on a meeting in which an entrepreneur asked him when AI would start replacing human workers.
Confessions Of A Climate Convert - Techonomy
As my family can attest, admitting when I am wrong has never been easy for me; but I feel it is important that I share a recent, if rather late, realization I had. Before attending the Techonomy Climate event in March, I didn't fully understand the scope and importance of the climate crisis. I am sharing my experience and perspective in the hopes that I may inspire some of my like-minded peers to grasp not only the urgency of this issue but also the tremendous business opportunities it holds. Like many of my friends and colleagues, my sense of purpose has always lain with the protection and prosperity of my family, friends and employees. I think this worldview is both natural and understandable.
Could a Robot Be a Friend?
When COVID-19 first hit, I was terrified to leave my home. As the father of three and a husband of 25 years, I felt helpless to protect my family as the narrative changed seemingly day-to-day. I knew fashioning medical masks from scarves was far from ideal, so I made masks for my family, friends, and any elderly customer that wanted one, using my 3D printer and some micron-level cloth filter material intended for residential HVAC systems. Still, I felt like I had no control over what was happening. The CDC reported in 2020 that between June 24 and 30, close to 40% of adults in the U.S. reported at least one adverse mental health concern--including anxiety, depression, substance use, and suicidal ideation, among others. UK-based researchers introduced the term "COVID-19 anxiety syndrome" in Psychiatry Research, noting avoidance, worrying, daily symptom checking, and threat monitoring as key traits.
- North America > United States (0.24)
- Europe > United Kingdom > England (0.04)
- Information Technology > Communications > Social Media (0.72)
- Information Technology > Artificial Intelligence > Robots (0.50)