Goto

Collaborating Authors

 Law


ArGen: Auto-Regulation of Generative AI via GRPO and Policy-as-Code

arXiv.org Artificial Intelligence

This paper introduces ArGen (Auto-Regulation of Generative AI systems), a framework for aligning Large Language Models (LLMs) with complex sets of configurable, machine-readable rules spanning ethical principles, operational safety protocols, and regulatory compliance standards. Moving beyond just preference-based alignment, ArGen is designed to ensure LLMs adhere to these multifaceted policies through a novel synthesis of principle-based automated reward scoring, Group Relative Policy Optimisation (GRPO), and an Open Policy Agent (OPA) inspired governance layer. This approach provides the technical foundation for achieving and demonstrating compliance with diverse and nuanced governance requirements. To showcase the framework's capability to operationalize a deeply nuanced and culturally-specific value system, we present an in-depth case study: the development of a medical AI assistant guided by principles from Dharmic ethics (such as Ahimsa and Dharma), as derived from texts like the Bhagavad Gita. This challenging application demonstrates ArGen's adaptability, achieving a 70.9% improvement in domain-scope adherence over the baseline. Through our open-source repository, we show that ArGen's methodology offers a path to 'Governable Al' systems that are technically proficient, ethically robust, and verifiably compliant for safe deployment in diverse global contexts.


CARE: Decoding Time Safety Alignment via Rollback and Introspection Intervention

arXiv.org Artificial Intelligence

As large language models (LLMs) are increasingly deployed in real-world applications, ensuring the safety of their outputs during decoding has become a critical challenge. However, existing decoding-time interventions, such as Contrastive Decoding, often force a severe trade-off between safety and response quality. In this work, we propose CARE, a novel framework for decoding-time safety alignment that integrates three key components: (1) a guard model for real-time safety monitoring, enabling detection of potentially unsafe content; (2) a rollback mechanism with a token buffer to correct unsafe outputs efficiently at an earlier stage without disrupting the user experience; and (3) a novel introspection-based intervention strategy, where the model generates self-reflective critiques of its previous outputs and incorporates these reflections into the context to guide subsequent decoding steps. The framework achieves a superior safety-quality trade-off by using its guard model for precise interventions, its rollback mechanism for timely corrections, and our novel introspection method for effective self-correction. Experimental results demonstrate that our framework achieves a superior balance of safety, quality, and efficiency, attaining a low harmful response rate and minimal disruption to the user experience while maintaining high response quality.



L.A. County employee charged with alleged hate crimes against Asian co-worker

Los Angeles Times

Things to Do in L.A. Tap to enable a layout that focuses on the article. L.A. County employee charged with alleged hate crimes against Asian co-worker Bhavin Patel allegedly broke into the county's downtown headquarters, pictured above, on Aug. 25, Aug. 26 and Sept. 4 with "prepared threats," which he scattered on his co-worker's desk and nearby cubicles, according to the L.A. County District Attorney's Office. Voice comes from the use of AI. Please report any issues or inconsistencies here . A Los Angeles County employee has been charged with felony hate crimes after allegedly breaking into the government's downtown headquarters three times in the last two weeks and placing death threats on the desk of an Asian co-worker.


Cindy Cohn Is Leaving the EFF, but Not the Fight for Digital Rights

WIRED

After 25 years at the Electronic Frontier Foundation, Cindy Cohn is stepping down as executive director. In a WIRED interview, she reflects on encryption, AI, and why she's not ready to quit the battle. After a quarter century defending digital rights, Cindy Cohn announced on Tuesday that she is stepping down as executive director of the Electronic Frontier Foundation. Cohn, who has led the San Francisco-based nonprofit since 2015, says she will leave the role later this year, concluding a chapter that helped define the modern fight over online freedom. Cohn first rose to prominence as lead counsel in, the 1990s case that overturned federal restrictions on publishing encryption code. As EFF's legal director and later executive director, she guided the group through legal challenges to government surveillance, reforms to computer crime laws, and efforts to hold corporations accountable for data collection. Over the past decade, EFF has expanded its influence, becoming a central force in shaping the debate over privacy, security, and digital freedom. In an interview with WIRED, Cohn reflected on EFF's foundational encryption victories, its unfinished battles against National Security Agency (NSA) surveillance, and the organization's work protecting independent security researchers.


Does Society Have Too Many Rules?

The New Yorker

Does Society Have Too Many Rules? When regular people seem burdened by bureaucracy, and the powerful act as they choose, it's worth asking whether we've forgotten what makes rules effective. I live in a three-generation household. Our place is big, but crowded: all of us have hobbies, and so every shelf or surface contains toys, books, art supplies, sporting goods, craft projects, cameras, musical instruments, or kitchen gadgets. Before the table can be set for dinner, it must be cleared of a board game or marble run. My desk, where I aim to write in the mornings, has been repurposed as a drone-repair workshop. The property includes two broken-down sheds and a garage.


Trump EPA wants to fast track permits for AI infrastructure

Al Jazeera

Is Chicago the violent crime capital of the US? Why did India-US relations decline so fast? Why is the US expanding in the Caribbean? The United States Environmental Protection Agency has proposed new measures aimed at speeding construction of infrastructure needed for the rapid buildup of data centres for artificial intelligence that would enable companies to start building before obtaining air permits. The EPA announced its new proposal on Tuesday.


Robots to the rescue: miniature robots offer new hope for search and rescue operations

Robohub

In the critical 72 hours after an earthquake or explosion, a race against the clock begins to find survivors. When a powerful earthquake hit central Italy on 24 August 2016, killing 299 people, over 5 000 emergency workers were mobilised in search and rescue efforts that saved dozens from the rubble in the immediate aftermath. The pressure to move fast can create risks for first responders, who often face unstable environments with little information about the dangers ahead. But this type of rescue work could soon become safer and more efficient thanks to a joint effort by EU and Japanese researchers. Rescue organisations, research institutes and companies from both Europe and Japan worked together from 2019 to 2023 to develop a new generation of tools blending robotics, drone technology and chemical sensing to transform how emergency teams operate in disaster zones.


Is AI the New Frontier of Women's Oppression?

WIRED

Is AI the New Frontier of Women's Oppression? In her new book, feminist author Laura Bates explores how sexbots, AI assistants, and deepfakes are reinventing misogyny and harming women. After spending her early twenties as a nanny in the UK, Laura Bates noticed that the young girls she was caring for were preoccupied by their bodies, spurred on by the marketing they were receiving. In 2012, Bates, a London-based feminist author and activist, started The Everyday Sexism Project, a website dedicated to documenting and combatting sexism, misogyny, and gendered violence around the world by highlighting insidious instances of it such as invisible labor, referring to women as girls and commenting on their attire in professional settings. The site was turned into a book in 2014.


How Google dodged a major breakup – and why OpenAI is to thank for it

The Guardian

The reason for the relative tameness of the penalty is the emergence of real competition to Google - what the case concerned in the first place. The reason for the relative tameness of the penalty is the emergence of real competition to Google - what the case concerned in the first place. An antitrust apocalypse has been averted, and it's all down to its biggest competitor, according to the judge who could've forced Google to sell Chrome I'm your host, Blake Montgomery, writing to you as I finish the audiobook version of Don DeLillo's White Noise, which I can't say I found compelling. In tech - artificial intelligence is having its day in court with an 11th-hour appearance in Google's landmark antitrust trial and Anthropic's major settlement with book authors. Google dodged a catastrophic breakup, and it has its biggest competitor to thank for that, according to the judge who could have forced the tech giant to sell off Chrome, the most popular web browser in the world, and perhaps Android, the world's most widely used mobile operating system.